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DESCRIPTION 

NUCLEIC ACID TREATMENT OF DISEASES OR CONDITIONS RELATED TO 

LEVELS OF RAS. HER2 AND HIV 

This patent application claims priority from McSwiggen USSN 60/294,140, filed May 
5 29, 2001, entitled "Enzymatic Nucleic Acid Treatment of Diseases or Conditions Related To 
Levels of HIV. " McSwiggen USSN 60/296,249 filed June 6, 2001, entitled "Enzymatic 
Nucleic Acid Treatment of Diseases or Conditions Related to Levels of HER2." and 
McSwiggen USSN 60/318,471, filed September 10, 2001, entitled "Enzymatic Nucleic Acid 
Treatment of diseases or Conditions Related to Levels of RAS ." Each of these applications is 
1 0 hereby incorporated by reference herein in its entirety including the drawings and tables. 

Technical Field Of The Invention 

The present invention relates to novel nucleic acid compounds and methods for the 
treatment or diagnosis of diseases or conditions related to levels of Ras gene expression, such 
15 as K-Ras, H-Ras, and/or N-Ras expression, HIV infection such as HTV-l, ^nd HER2 gene 
expression. 

Background Of The Invention 

Transfomiation is a cumulative process whereby normal control of cell growth and 
differentiation is interrupted, usually through the accumulation of mutations affecting the 

20 expression of genes that regulate cell growth and differentiation. 

The platelet derived growth factor (PDGF) system has served as a prototype for 
identification of substrates of the receptor tyrosine kinases. Certain enzymes become 
activated by the PDGF receptor kinase, including phospholipase C and phosphatidylinositol 3* 
kinase, Ras guanosine triphosphate (GTPase) activating protein (GAP) and src-like tyrosine 

25 kinases. GAP regulates the function of the Ras protein by stimulating the GTPase activity of 
the 21 kD Ras protein. Barbacid, 56 Ann. Rev. Biochem. 779, 1987. Microinjection of 
oncogenically activated Ras into NIH 3T3 cells has been shown to induce DNA synthesis. 
Mutations that cause oncogenic activation of Ras lead to accumulation of Ras bound to GTP, 
the active form of the molecule. These mutations block the ability of GAP to convert Ras to 

30 the inactive form. Mutations that impair the interactions of Ras with GAP also block the 
biological function of Ras. 
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While a number of Ras alleles exist (N-Ras, K-Ras, H-Ras) which have been 
implicated in carcinogenesis, the type most often associated with colon and pancreatic 
carcinomas is K-Ras. Enzymatic nucleic acid molecules which are targeted to certain regions 
of the K-Ras allelic mRNAs may also prove inhibitory to the function of the other allelic 

5 mRNAs of the N-Ras and H-Ras genes. 

Scanlon, International PCT Publication Nos. WO 91/18625, WO 91/18624, and WO 
91/18913 describes a ribozyme effective to cleave oncogene RNA from the H-Ras gene. This 
ribozyme is said to inhibit H-ras expression in response to exogenous stimuli. Reddy 
WO92/00080 describes the use of ribozymes as therapeutic agents for leukemias, such as 

1 0 chronic myelogenous leukemia (CML) by targeting specific portions of the BCR-ABL gene 
transcript. 

Thompson et at., International PCT publication No. WO 99/54459, describe nucleic 
acid molecules that modulate gene expression, including Ras gene expression. 

•Zhang et al, 2000, Gene Ther., 7, 2041; Takunaga et al, 2000, Br. J. Cancer., 83, 833; 
1 5 Zhang et al, 2000, Mol Biotechnol, 15, 39; hie et al, 2000, Mol Urol. 4, 61; Kijima and 
Scanlon, 2000, Mol. Biotechnol, 14, 59; Funato et al, 2000, Cancer Gene Ther., 7, 495; 
Tsuchida et al, 2000, Cancer Gene Ther., 7, 373; Zhang et al, 2000, Methods Mol Med., 35, 
261; Irie et al, 1999, Antisense Nucleic Acid Drug Dev., 9, 341; Giannini et al, 1999, 
Nucleic Acids Res., 27, 2737; Fang et al, 1999, J. Med. Coll PLA, 14, 25; Tong et al, 1998, 
20 Methods Mol Med., 11, 209; Ohkawa and Kashani-Sabet, 1998, Methods Mol. Med., 11, 153; 
Scherr et al, 1999, Gene Titer., 6, 152; Tsuchida et al, 1998, Biochem. Biophys. Res. 
Commun., 252, 368; Scherr et al, 1998, Gene Titer., 5, 1227; Uhlmann et al, European 
Patent Application EP 808898; Scherr et al, 1997, J. Biol. Chem., 272, 14304; Chang et al, 
1997, J. Cancer Res. Clin. Oncol, 123, 91; Ohta et al, 1996, Nucleic Acids Res., 24, 938; 
25 Ohta et al, 1994, Ann. NY. Acad. Sci., 716, 242; and Funato et al, 1994, Biochem. 
Pharmacol, 48, 1471 all describe specific ribozymes targeting certain K-Ras, H-Ras, or N- 
Ras RNA sequences. 

Todd, International PCT Publication Nos. WO 01/49877, WO 99/50452, and WO 
99/45146 describes specific DNAzymes targeting K-Ras for diagnostic applications. 
50 Acquired immunodeficiency syndrome (AIDS) is thought to be caused by infection 

with the human immunodeficiency virus, for example fflV-1. Draper et al, U.S. Patent Nos. 
6,159,692, 5,972,704, 5,693,535, and International PCT Publication Nos. WO WO 93/23569, 
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WO 95/04818, describe enzymatic nucleic acid molecules targeting HTV. Todd et al, 
International PCT Publication No. WO 99/50452, describe methods for using specific 
DNAzyme motifs for detecting the presence of certain HIV RNAs. Sriram and Banerjea, 
2000, Biochem J., 352, 667-673, describe specific RNA cleaving DNA enzymes targeting 
5 HIV-1. Zhang et al, 1999, FEES Lett., 458, 151-156, describe specific RNA cleaving DNA 
enzymes used in the inhibition of HIV-1 infection. 

HER2 (also known as neu, erbB2 and c-erbB2) is an oncogene that encodes a 185-kDa 
transmembrane tyrosine kinase receptor. HER2 is a member of the epidermal growth factor 
receptor (EGFR) family and shares partial homology with other family members. In normal 

10 adult tissues HER2 expression is low. However, HER2 is overexpressed in at least 25-30% 
of breast (McGuire, H.C. and Greene, M.I. (1989) The neu (c-erbB-2) oncogene. Semin. 
Oncol. 16: 148-155) and ovarian cancers (Berchuck, A. Kamel, A., Whitaker, R. et al 
(1990)). Overexpression of her-2/neu is associated with poor survival in advanced epithelial 
ovarian cancer. Cancer Research 50: 4087-4091). Furthermore, overexpression of HER2 in 

1 5 malignant breast tumors has been correlated with increased metastasis, chemoresistance and 
poor survival rates (Slamon et al 9 1987 Science 235: 177-182). Because HER2 expression is 
high in aggressive human breast and ovarian cancers, but low in normal adult tissues, it is an 
attractive target for enzymatic nucleic acid-mediated therapy. McSwiggen et al., International 
PCT Publication No. WO 01/16312 and Beigelman et al, International PCT Publication No. 

10 WO 99/55857 describe enzymatic nucleic acid molecules targeting HER2. Thompson and 
Draper, US Patent No. 5,599,704, describes enzymatic nucleic acid molecules targeting 
HER2 (erbB2/neu) gene expression. 

Summary Of The Invention 

The present invention features nucleic acid molecules, including, for example, antisense 
15 oligonucleotides, siRNA, aptamers, decoys and enzymatic nucleic acid molecules such as 
DNAzyme enzymatic nucleic acid molecules, which modulate expression of nucleic acid 
molecules encoding Ras oncogenes, such as K-Ras, H-Ras, and N-Ras. In one embodiment, 
the invention features an enzymatic nucleic acid molecule comprising a sequence selected 
from the group consisting of SEQ ID NOs: 2329-4655. 
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In another embodiment, the invention features an enzymatic nucleic acid molecule 
comprising at least one binding arm having a sequence complementary to a sequence selected 
from the group consisting of SEQ ID NOs: 1-2328. 

In another embodiment, the invention features a siRNA molecule having 
complementarity to a sequence selected from the group consisting of SEQ ID NOs: 1-2328. 

In another embodiment, the invention features an antisense molecule having 
complementarity to a sequence selected from the group consisting of SEQ ID NOs: 1-2328. 

In another aspect of the invention, the nucleic acid of the invention is adapted to treat 
cancer. 

In one embodiment, the enzymatic nucleic acid molecule of the invention has an 
endonuclease activity to cleave RNA having a K-Ras sequence. 

In another embodiment, the enzymatic nucleic acid molecule of the invention has an 
endonuclease activity to cleave RNA having an H-Ras sequence. 

In another embodiment, the enzymatic nucleic acid molecule of the invention has an 
endonuclease activity to cleave RNA having anN-Ras sequence. 

In one embodiment, the siRNA molecule of the invention has RNA interference activity 
to K-Ras expression. 

In another embodiment, the siRNA molecule of the invention has RNA interference 
activity to H-Ras expression. 

In another embodiment, the siRNA molecule of the invention has RNA interference 
activity to N-Ras expression. 

In one embodiment, a siRNA molecule of the invention comprises a double stranded 
RNA wherein one strand of the RNA is complementary to the RNA of K-Ras, H-Ras, and/or 
N-Ras gene. In another embodiment, a siRNA molecule of the invention comprises a double 
stranded RNA wherein one strand of the RNA comprises a portion of a sequence of RNA of 
K-Ras, H-Ras, and/or N-Ras gene sequence. In yet another embodiment, a siRNA molecule 
of the invention comprises a double stranded RNA wherein both strands of RNA are 
connected by a non-nucleotide linker. Alternately, a siRNA molecule of the invention 
comprises a double stranded RNA wherein both strands of RNA are connected by a 
nucleotide linker, such as a loop or stem loop structure. 
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la one embodiment, a single strand component of a siRNA molecule of the invention is 
from about 14 to about 50 nucleotides in length. In another embodiment, a single strand 
component of a siRNA molecule of the invention is about 14, 15, 16, 17, 18, 19, 20, 21, 22, 
23, 24, 25, 26, 27, or 28 nucleotides in length. In yet another embodiment, a single strand 
component of a siKNA molecule of the invention is about 23 nucleotides in length. In one 
embodiment, a siRNA molecule of the invention is from about 28 to about 56 nucleotides in 
length. In another embodiment, a siRNA molecule of the invention is about 40, 41, 42, 43, 
44, 45, 46, 47, 48, 49, 50, 51, or 52 nucleotides in length. In yet another embodiment, a 
siRNA molecule of the invention is about 46 nucleotides in length. 

In one embodiment, the DNAzyme molecule of the invention is in a "10-23" 
configuration (see for example Santoro et al, 1997, PNAS, 94, 4262 and Joyce et al, US 
5,807,718). In another embodiment, the DNAzyme comprises a sequence complementary to a 
sequence selected from the group consisting of SEQ ID NOs: 1-2328. In yet another 
embodiment, the DNAzyme comprises a sequence selected from the group consisting of SEQ 
ID NOs: 2329-4655. 

In another embodiment, the nucleic acid molecule of the invention comprises between 
12 and 100 bases complementary to a nucleic acid molecule having a K-Ras sequence. In yet 
another embodiment, the enzymatic nucleic acid comprises between 14 and 24 bases 
complementary to a nucleic acid molecule having a K-Ras sequence. 

In another embodiment, the nucleic acid molecule of the invention comprises between 
12 and 100 bases complementary to a nucleic acid molecule having an H-Ras sequence. In 
yet another embodiment, the nucleic acid molecule of the invention comprises between 14 
and 24 bases complementary to a nucleic acid molecule having an H-Ras sequence. 

In another embodiment, the nucleic acid molecule of the invention comprises between 
12 and 100 bases complementary to a nucleic acid molecule having an N-Ras sequence. In 
yet another embodiment, the nucleic acid molecule of the invention comprises between 14 
and 24 bases complementary to a nucleic acid molecule having an N-Ras sequence. 

In yet another embodiment, the nucleic acid molecule of the invention is chemically 
synthesized. The nucleic acid molecule can comprise at least one 2* -sugar modification, at 
least one nucleic acid base modification, and/or at least one phosphate backbone 
modification. 
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In one embodiment, the invention features a mammalian cell comprising the nucleic 
acid molecule of the invention. Li another embodiment, the mammalian cell of the invention 
is a human cell. 

In another embodiment, the invention features a method of modulating K-Ras activity 
in a cell, comprising contacting the cell with the nucleic acid molecule of the invention, under 
conditions suitable for the modulation of K-Ras activity. 

In another embodiment, the invention features a method of modulating H-Ras activity 
in a cell, comprising contacting the cell with the nucleic acid molecule of the invention, under 
conditions suitable for the modulation of H-Ras activity. 

In another embodiment, the invention features a method of modulating N-Ras activity 
in a cell, comprising contacting the cell with the nucleic acid molecule of the invention, under 
conditions suitable for the modulation of N-Ras activity. 

In another embodiment, the invention features a method of treatment of a subject 
having a condition associated with the level of K-Ras, comprising contacting cells of the 
subject with the nucleic acid molecule of the invention, under conditions suitable for the 
treatment. 

In another embodiment, the invention features a method of treatment of a subject 
having a condition associated with the level of H-Ras, comprising contacting cells of the 
subject with the nucleic acid molecule of the invention, under conditions suitable for the 
treatment. 

In another embodiment, the invention features a method of treatment of a subject 
having a condition associated with the level of N-Ras, comprising contacting cells of the 
subject with the nucleic acid molecule of the invention, under conditions suitable for the 
treatment. 

In one embodiment, a method of treatment of the invention further comprises the use of 
one or more drug therapies under conditions suitable for the treatment. 

In another embodiment, the invention features a method of cleaving RNA having a K- 
Ras sequence comprising contacting the K-Ras RNA with the enzymatic nucleic acid 
molecule of the invention under conditions suitable for the cleavage, for example, where the 
cleavage is carried out in the presence of a divalent cation, such as Mg2-h 
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In another embodiment, the invention features a method of cleaving RNA having a H- 
Ras sequence comprising contacting the H-Ras RNA with the enzymatic nucleic acid 
molecule of the invention under conditions suitable for the cleavage, for example, where the 
cleavage is carried out in the presence of a divalent cation, such as Mg2+. 

5 In another embodiment, the invention features a method of cleaving RNA having an N- 

Ras sequence comprising contacting the N-Ras RNA with the enzymatic nucleic acid 
molecule of the invention under conditions suitable for the cleavage, for example, where the 
cleavage is carried out in the presence of a divalent cation, such as Mg2+. 

In one embodiment, the nucleic acid molecule of the invention comprises a cap 
10 structure, for example, a 3\3'-linked or 5 , ,5'-linked deoxyabasic ribose derivative, wherein 
the cap structure is at the 5'-end, 3'-end, or both the 5'-end and the 3'-end of the nucleic acid 
molecule. 

In another embodiment, the invention features an expression vector comprising a 
nucleic acid sequence encoding at least one nucleic acid molecule of the invention in a 
15 manner that allows expression of the nucleic acid molecule. For example, the invention 
features an expression vector comprising a nucleic acid encoding a DNAzyme in a manner 
that allows expression of the DNAzyme. 

In yet another embodiment, the invention features a mammalian cell, for example a 
human cell, comprising an expression vector of the invention. 

20 In another embodiment, the expression vector of the invention further comprises a 

sequence for a nucleic acid molecule complementary to an RNA having K-Ras sequence. 

In another embodiment, the expression vector of the invention further comprises a 
sequence for a nucleic acid molecule complementary to an RNA having H-Ras sequence. 

In another embodiment, the expression vector of the invention further comprises a 
25 sequence for a nucleic acid molecule complementary to an RNA having N-Ras sequence. 

In one embodiment, an expression vector of the invention comprises a nucleic acid 
sequence encoding two or more nucleic acid molecules of the invention,, which can be the 
same or different. In another embodiment, an expression vector of the invention further 
comprises a sequence encoding an antisense nucleic acid molecule complementary to an RNA 
30 having a K-Ras, H-Ras or N-Ras sequence. 
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lii another embodiment, the invention features a method for treating cancer, for 
example colorectal cancer, bladder cancer, lung cancer, pancreatic cancer, breast cancer, or 
prostate cancer, comprising administering to a subject a nucleic acid molecule of the 
invention under conditions suitable for the treatment. A method of treatment of cancer of the 
invention can further comprise administering to a patient one or more other therapies, for 
example, monoclonal antibody therapy, such as Herceptin (trastuzumab); chemotherapy, such 
as paclitaxel (Taxol), docetaxel, cisplatin, methotrexate, cyclophosphamide, doxorubin, 
fluorouracil carboplatin, Leucovorin, hinotecan (CAMPTOSAR® or CPT-11 or 
Camptothecin-11 or Campto), Carboplatin, edatrexate, gemcitabine, or vinorelbine; radiation 
therapy, or analgesic therapy and/or any combination thereof. 

In another embodiment, the invention features a composition comprising a nucleic acid 
molecule of the invention in a pharmaceutically acceptable carrier. 

In one embodiment, the invention features a method of administering to a cell, for 
example a mammalian cell or human cell, the nucleic acid molecule of the invention 
comprising contacting the cell with the nucleic acid molecule under conditions suitable for 
administration. The method of administration can be in the presence of a delivery reagent, for 
example a lipid, cationic lipid, phospholipid, or liposome. 

The present invention features an enzymatic nucleic acid molecule which modulates 
expression of a nucleic acid molecule encoding a human immunodeficiency virus (HTV), for 
example HTV-1, HIV-2, and related viruses such as FTV-1 and SIV-1, or a HIV gene, for 
example LTR, nef, vif, tat, or rev, wherein the enzymatic nucleic acid molecule comprises a 
DNAzyme configuration. 

The invention also features an enzymatic nucleic acid molecule which modulates 
expression of a nucleic acid molecule encoding HTV or a component of HIV such as net, vif, 
tat, or rev, wherein the enzymatic nucleic acid molecule is in a Inozyme, G-cleaver, Zinzyme, 
DNAzyme or Amberzyme configuration. 

The present invention also features a siRNA molecule which modulates expression of a 
nucleic acid molecule encoding a human immunodeficiency virus (HTV), for example HTV-1, 
HTV-2, and related viruses such as FTV-1 and SIV-1, or a HTV gene, for example LTR, nef, 
vif, tat, or rev. 

The present invention features an enzymatic nucleic acid molecule comprising a 
sequence selected from the group consisting of SEQ ID NOs. 6727-6799. The invention also 
features an enzymatic nucleic acid molecule comprising at least one binding arm wherein one 
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or more of said binding arms comprises a sequence complementary to a sequence selected 
from the group consisting of SEQ ID NOs. 6642-6726. In addition, the present invention 
features a siRNA nucleic acid molecule comprising sequence complementary to a sequence 
selected from the group consisting of SEQ ID NOs. 1-76 and 140-148. 

In another embodiment, the siRNA molecule of the invention has RNA interference 
activity to HIV-1 expression and/or replication. 

In one embodiment, a siRNA molecule of the invention comprises a double stranded 
RNA wherein one strand of the RNA is complementary to the RNA of HIV-1 genome or 
genes. In another embodiment, a siRNA molecule of the invention comprises a double 
stranded RNA wherein one strand of the RNA comprises a portion of a sequence of HIV-1 
genome or gene sequence. In yet another embodiment, a siRNA molecule of the invention 
comprises a double stranded RNA wherein both strands of RNA are connected by a non- 
nucleotide linker. Alternately, a siRNA molecule of the invention comprises a double 
stranded RNA wherein both strands of RNA are connected by a nucleotide linker, such as a 
loop or stem loop structure. 

In one embodiment, a single strand component of a siRNA molecule of the invention is 
from about 14 to about 50 nucleotides in length. In another embodiment, a single strand 
component of a siRNA molecule of the invention is about 14, 15, 16, 17, 18, 19, 20, 21, 22, 
23, 24, 25, 26, 27, or 28 nucleotides in length. In yet another embodiment, a single strand 
component of a siRNA molecule of the invention is about 23 nucleotides in length. In one 
embodiment, a siRNA molecule of the invention is from about 28 to about 56 nucleotides in 
length. In another embodiment, a siRNA molecule of the invention is about 40, 41, 42, 43, 
44, 45, 46, 47, 48, 49, 50, 51, or 52 nucleotides in length. In yet another embodiment, a 
siRNA molecule of the invention is about 46 nucleotides in length. 

La one embodiment, a nucleic acid molecule of the invention is adapted to treat HIV 
infection or acquired immunodeficiency syndrome (AIDS). 

In another embodiment, the enzymatic nucleic acid molecule of the invention has an 
endonuclease activity to cleave RNA having HIV sequence. 

In yet another embodiment, the enzymatic nucleic acid molecule of the invention is in 
an Inozyme, Zinzyme, G-cleaver, Amberzyme, DNAzyme or Hammerhead configuration. 
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In another embodiment, the fciozyme of the invention comprises a sequence 
complementary to a sequence selected from the group consisting of SEQ ID NOs. 6648-6655, 
or comprises a sequence selected from the group consisting of SEQ ID NOs. 6733-6740. 

In another embodiment, the Zinzyme of the invention comprises a sequence 
5 complementary to a sequence selected from the group consisting of SEQ ID NOs. 6656-6663 
and 6723-6726, or comprises a sequence selected from the group consisting of SEQ ID NOs 
6741-6748 and 6795-6799. 

In another embodiment, the Amberzyme of the invention comprises a sequence 
complementary to a sequence selected from the group consisting of SEQ ID NOs. 6656-6688, 
1 0 or comprises a sequence selected from the group consisting of SEQ ID NOs. 6762-6789. 

In another embodiment, the DNAzyme of the invention comprises a sequence 
complementary to a sequence selected from the group consisting of SEQ ID NOs. 6656-6668 
and 6718-6722, or comprises a sequence selected from the group consisting of SEQ ID NOs. 
6749-6761 and 6790-6794. 
15 In another embodiment, the Hammerhead of the invention comprises a sequence 

complementary to a sequence selected from the group consisting of SEQ ID NOs. 6642-6647, 
or comprises a sequence selected from the group consisting of SEQ ID NOs 6727-6732. 

In one embodiment, a nucleic acid molecule of the invention comprises between 12 and 
100 bases complementary to a RNA sequence encoding HTV genome, RNA, and/or proteins. 
20 In another embodiment, a nucleic acid molecule of the invention comprises between 14 and 
24 bases complementary to a RNA sequence encoding HIV genome, RNA, and/or proteins. 

In yet another embodiment, a nucleic acid molecule of the invention is chemically 
synthesized. A nucleic acid molecule of the invention can comprise at least one 2'-sugar 
modification, at least one nucleic acid base modification, and/or at least one phosphate 
25 backbone modification. 

The present invention features a mammalian cell including a nucleic acid molecule of 
the invention. In one embodiment, the mammalian cell of the invention is a human cell. 

The invention features a method of reducing HTV activity in a cell, comprising 
contacting the cell with a nucleic acid molecule of the invention, under conditions suitable for 
30 the reduction of HIV activity. 
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The invention also features a method of treating a subject having a condition associated 
with the level of HIV, comprising contacting cells of the subject with a nucleic acid molecule 
of the invention, under conditions suitable for the treatment. 

In one embodiment, methods of treatment contemplated by the invention comprise the 
5 use of one or more drug therapies under conditions suitable for the treatment. 

The invention features a method of cleaving RNA comprising a HIV nucleic acid 
sequence comprising contacting an enzymatic nucleic acid molecule of the invention with the 
RNA under conditions suitable for the cleavage. In one embodiment, the cleavage 
contemplated by the invention is carried out in the presence of a divalent cation, for example 
10- Mg 2 \ 

The present invention features a method for treatment of acquired immunodeficiency 
syndrome (AIDS) or an AIDS related condition, for example Kaposi's sarcoma, lymphoma, 
cervical cancer, squamous cell carcinoma, cardiac myopathy, rheumatic disease, or 
opportunistic infection, comprising administering to a subject a nucleic acid molecule of the 
1 5 invention under conditions suitable for the treatment. 

In one embodiment, nucleic acid molecule of the invention comprises at least five 
ribose residues, at least ten 2-O-methyl modifications, and a 3'- end modification, for 
example a 3 '-3 5 inverted abasic moiety. 

In another embodiment, a nucleic acid molecule of the invention further comprises 
20 phosphorothioate linkages on at least three of the 5 1 terminal nucleotides. 

In yet another embodiment, a DNAzyme of the invention comprises at least ten 2-0- 
methyl modifications and a 3 '-end modification, for example a 3 '-3' inverted abasic moiety. 
In a further embodiment, the DNAzyme of the invention further comprises phosphorothioate 
linkages on at least three of the 5' terminal nucleotides. 
25 In another embodiment, other drug therapies of the invention comprise antiviral 

therapy, monoclonal antibody therapy, chemotherapy, radiation therapy, analgesic therapy, or 
anti-inflammatory therapy. 

In yet another embodiment, antiviral therapy of the invention comprises treatment with 
AZT, ddC, ddl, d4T, 3TC, Ribavirin, delvaridine, nevirapine, efravirenz, ritonavir, saquinivir, 
30 indinavir, amprenivir, nelfinavir, or lopinavir. 

The invention features a composition comprising a nucleic acid molecule of the 
invention in a pharmaceutically acceptable carrier. 
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In one embodiment, the invention features a method of administering to a cell, for 
example a mammalian cell or human cell, an enzymatic nucleic acid molecule of the 
invention comprising contacting the cell with the enzymatic nucleic acid molecule under 
conditions suitable for the adrmnistration. The method of adininistration can be in the 
presence of a delivery reagent, for example a lipid, cationic lipid, phospholipid, or liposome. 

The present invention features enzymatic nucleic acid molecules which modulate 
expression of nucleic acid molecules encoding HER2. The present invention also features 
siRNA molecules which modulate the expression of nucleic acid molecules encoding HER2. 

In another embodiment, the invention features a siRNA molecule having 
complementarity to a sequence selected from the group consisting of SEQ ID NOs: 4656- 
5643 and 6632-6636. 

In one embodiment, the invention features an enzymatic nucleic acid molecule 
comprising a sequence selected from the group consisting of SEQ ID NOs: 5644-6631 and 
6637-6641. 

In another embodiment, the invention features an enzymatic nucleic acid molecule 
comprising at least one binding arm having a sequence complementary to a sequence selected 
from the group consisting of SEQ ID NOs: 4656-5643 and 6632-6636. 

In yet another embodiment, a nucleic acid of the invention is adapted to treat cancer. 

In another embodiment, an enzymatic nucleic acid molecule of the invention has an 
endonuclease activity to cleave RNA having HER2 sequence. 

In another embodiment, the siRNA molecule of the invention has RNA interference 
activity to N-Ras gene expression. 

In one embodiment, a siRNA molecule of the invention comprises a double stranded 
RNA wherein one strand of the RNA is complementary to the RNA of HER2 gene. In 
another embodiment, a siRNA molecule of the invention comprises a double stranded RNA 
wherein one strand of the RNA comprises a portion of a sequence of RNA having of HER2 
gene sequence. In yet another embodiment, a siRNA molecule of the invention comprises a 
double stranded RNA wherein both strands of RNA are connected by a non-nucleotide linker. 
Alternately, a siRNA molecule of the invention comprises a double stranded RNA wherein 
both strands of RNA are connected by a nucleotide linker, such as a loop or stem loop 
structure. 
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In one embodiment, a single strand component of a siRNA molecule of the invention is 
from about 14 to about 50 nucleotides in length. In another embodiment, a single strand 
component of a siRNA molecule of the invention is about 14, 15, 16, 17, 18, 19, 20, 21, 22, 
23, 24, 25, 26, 27, or 28 nucleotides in length. In yet another embodiment, a single strand 
5 component of a siRNA molecule of the invention is about 23 nucleotides in length. In one 
embodiment, a siRNA molecule of the invention is from about 28 to about 56 nucleotides in 
length. In another embodiment, a siRNA molecule of the invention is about 40, 41, 42, 43, 
44, 45, 46, 47, 48, 49, 50, 51, or 52 nucleotides in length. In yet another embodiment, a 
siRNA molecule of the invention is about 46 nucleotides in length. 

10 In one embodiment, a DNAzyme molecule of the invention is in a "10-23" 

configuration. In another embodiment, a DNAzyme of the invention comprises a sequence 
complementary to a sequence having SEQ ID NOs: 4656-5643 and 6632-6636. In yet another 
embodiment, a DNAzyme molecule of the invention comprises a sequence having SEQ ID 
NOs: 5644-6631 and 6637-6641. 

15 In another embodiment, a nucleic acid molecule of the invention comprises between 12 

and 100 bases complementary to a nucleic acid molecule having HER2 sequence. In yet 
another embodiment, a nucleic acid molecule of the invention comprises between 14 and 24 
bases complementary to a nucleic acid molecule having HER2 sequence. 

In yet another embodiment, a nucleic acid molecule of the invention is chemically 
20 synthesized. A nucleic acid molecule of the invention can comprise at least one 2'-sugar 
modification, at least one nucleic acid base modification, and/or at least one phosphate 
backbone modification. 

In one embodiment, the invention features a mammalian cell comprising a nucleic acid 
molecule of the invention. In another embodiment, the mammalian cell of the invention is a 
25 human cell. 

In another embodiment, the invention features a method of reducing HER2 activity in a 
cell, comprising contacting the cell with the nucleic acid molecule of the invention, under 
conditions suitable for the reduction of HER2 activity. 

In another embodiment, the invention features a method of treatment of a subject 
30 having a condition associated with the level of HER2, comprising contacting cells of the 
subject with the nucleic acid molecule of the invention, under conditions suitable for the 
treatment. 
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M one embodiment, a method of treatment of the invention former comprises the use of 
one or more drug therapies under conditions suitable for the treatment. 

In another embodiment, the invention features a method of cleaving RNA having HER2 
sequence comprising contacting an enzymatic nucleic acid molecule of the invention with the 
RNA under conditions suitable for the cleavage, for example, where the cleavage is carried 
out in the presence of a divalent cation, such as Mg2+. 

In one embodiment, a nucleic acid molecule of the invention comprises a cap structure, 
for example a 3 ',3 '-linked or 5',5'-linked deoxyabasic ribose derivative, wherein foe cap 
structure is at the 5'-end, 3'-end, or both foe 5'-end and foe 3'-end of foe enzymatic nucleic 
10 acid molecule. 

In another embodiment, foe invention features an expression vector comprising a 
nucleic acid sequence encoding at least one nucleic acid molecule of foe invention, for 
example a DNAzyme or siRNA molecule, in a manner that allows expression of foe nucleic 
acid molecule. 

15 In yet another embodiment, foe invention features a mammalian cell, for example a 

human cell, comprising an expression vector of the invention. 

In another embodiment, an expression vector of foe invention further comprises a 
sequence for a nucleic acid molecule complementary to a nucleic acid molecule having HER2 
sequence. 

20 In one embodiment, an expression vector of foe invention comprises a nucleic acid 

sequence encoding two or more nucleic acid molecules, which can be foe same or different In 
another embodiment, an expression vector of foe invention further comprises a sequence 
encoding an antisense nucleic acid molecule complementary to a nucleic acid molecule 
having a HER2 sequence. 

25 In another embodiment, foe invention features a method for treating cancer, for 

example breast cancer or ovarian cancer, comprising administering to a subject a nucleic acid 
molecule of foe invention under conditions suitable for foe treatment. A method of treatment 
of cancer of foe invention can further comprise aciministering to a patient one or more other 
therapies, for example, monoclonal antibody therapy, such as Herceptin (trastuzumab); 

30 chemotherapy, such as paclitaxel (Taxol), docetaxel, cisplatin, methotrexate, 
cyclophosphamide, doxorubin, fluorouracil carboplatin, Leucovorin, hinotecan 
(CAMPTOSAR® or CPT-11 or Camptofoecin-1 1 or Campto), Carboplatin, edatrexate, 
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gemcitabine, or vinorelbine; radiation therapy, or analgesic therapy and/or any combination 
thereof. 

In another embodiment, the invention features a composition comprising a nucleic acid 
molecule of the invention in a pharmaceutically acceptable carrier. 
5 : In one embodiment, the invention features a method of administering to a cell, for 

example a mammalian cell or human cell, a nucleic acid molecule of the invention 
comprising contacting the cell with the nucleic acid molecule under conditions suitable for 
administration. The method of administration can be in the presence of a delivery reagent, for 
example a lipid, cationic lipid, phospholipid, or liposome. 

10 

Detailed Description of the Invention 
First the drawings will be described briefly. 
Drawings 

Figure 1 shows examples of chemically stabilized ribozyme motifs. HH Rz, represents 
15 hammerhead ribozyme motif (Usman et al, 1996, Curr. Op. Struct Bio., 1, 527); NCH Rz 
represents the NCH ribozyme motif (Ludwig et al, International PCT Publication No. WO 
98/58058 and US Patent Application Serial No. 08/878,640); G-Cleaver, represents G- 
cleaver ribozyme motif (Kore et al, 1998, Nucleic Acids Research 26, 41 16-4120, Eckstein et 
al, US 6,127,173). N or n, represent independently a nucleotide which can be same or 
20 different and have complementarity to each other; rl, represents ribo-Inosine nucleotide; 
arrow indicates the site of cleavage within the target. Position 4 of the HH Rz and the NCH 
Rz is shown as having 2'-C-allyl modification, but those skilled in the art will recognize that 
this position can be modified with other modifications well known in the art, so long as such 
modifications do not significantly inhibit the activity of the ribozyme. 

25 ^ Figure 2 shows an example of the Amberzyme ribozyme motif that is chemically 

stabilized (see for example Beigelman et al, International PCT publication No. WO 
99/55857 and US Patent Application Serial No. 09/476,387.). 

Figure 3 shows an example of a Zinzyme A ribozyme motif that is chemically 
stabilized (see for example Beigelman et al, International PCT publication No. WO 
30 99/55857 and US Patent Application Serial No. 09/918,728). 
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Figure 4 shows an example of a DNAzyme motif described by Santoro et at, 1991, 
PNAS, 94, 4262 and Joyce et al, US 5,807,718 . 

The invention features novel nucleic acid molecules, including antisense 
oligonucleotides, siRNA and enzymatic nucleic acid molecules, and methods to modulate 
gene expression, for example, genes encoding K-Ras, H-Ras and/or N-Ras. In particular, the 
instant invention features nucleic-acid based molecules and methods to down-regulate the 
expression of K-Ras, H-Ras and/or N-Ras gene sequences. 

The invention features one or more nucleic acid-based molecules and methods that 
independently or in combination modulate the expression of a gene or genes encoding Ras 
proteins. In particular embodiments, the invention features nucleic acid-based molecules and 
methods that modulate the expression of K-Ras gene, for example, Genbank Accession No. 
NM_004985; H-Ras gene, for example, Genbank Accession No. NM_005343; and/or N-Ras 
gene, for example, Genbank Accession No. NM_002524. 

The description below of the various aspects and embodiments is provided with 
reference to exemplary K-Ras, H-Ras, and N-Ras genes, referred to hereinafter collectively as 
Ras. However, the various aspects and embodiments are directed to equivalent sequences and 
also to other genes which encode K-Ras, H-Ras and/or N-Ras proteins and similar proteins to 
K-Ras, H-Ras and/or N-Ras. For example, the invention relates to genes with homology to 
genes that encode K-Ras, H-Ras and/or N-Ras and genes that encode proteins with similar 
function to K-Ras, H-Ras, and N-Ras proteins. Those additional genes can be analyzed for 
target sites using the methods described herein. Thus, the modulation and the effects of such 
modulation of the other genes can be determined as described herein. 

In one embodiment, the invention features the use of an enzymatic nucleic acid 
molecule, preferably in the hammerhead, NCH, G-cleaver, amberzyme, zinzyme and/or 
DNAzyme motif, to modulate the expression of a Ras gene or inhibit Ras activity. In one 
embodiment, the invention features the use of these enzymatic nucleic acid molecules to 
down-regulate the expression of a Ras gene or inhibit Ras activity. In another embodiment, 
the invention features the use of an antisense oligonucleotide molecule to modulate, for 
example, down-regulate, the expression of a Ras gene or inhibit Ras activity. 

The invention features novel enzymatic nucleic acid molecules, siRNA molecules, and 
methods to modulate expression and/or activity of human immunodeficiency virus (HIV), for 
example HTV-1, HJV-2, and related viruses such as FIV-1 and SIV-l, or a HTV gene, for 
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example LTR, nef, vif, tat, or rev. In particular, the instant invention features nucleic-acid 
based molecules and methods to inhibit the replication of a HIV or related virus. 

The invention features one or more nucleic acid-based molecules and methods that 
independently or in combination modulate the expression of gene(s) encoded by HIV and/or 
5 inhibit the replication of HTV. In particular embodiments, the invention features nucleic acid- 
based molecules and methods that modulate the expression of HTV-1 encoded genes, for 
example (Genbank Accession No. AJ302647); HIV-2 gene, for example (Genbank Accession 
No. NC_001722), FIV-1, for example (Genbank Accession No. NC_001482), SIV-1, for 
example (Genbank Accession No. M66437), LTR, for example included in (Genbank 
1 0 Accession No. AJ302647), nef> for example included in (Genbank Accession No. AJ302647), 
* vif, for example included in (Genbank Accession No. AJ302647), tat, for example included in 
(Genbank Accession No. AJ3 02647), and rev, for example included in (Genbank Accession 
No. AJ302647). 

The description below of the various aspects and embodiments is provided with 
15 reference to the exemplary HTV-1 gene, referred to herein as HTV. However, the various 
aspects and embodiments are also directed to other genes which encode HTV proteins and 
similar viruses to HIV. Those additional genes can be analyzed for target sites using the 
methods described for HTV. Thus, the inhibition and the effects of such inhibition of the 
other genes can be performed as described herein. 

20 Due to the high sequence variability of the HTV genome, selection of nucleic acid 

molecules for broad therapeutic applications would likely involve the conserved regions of 
the HIV genome. Specifically, the present invention describes nucleic acid molecules that 
cleave the conserved regions of the HIV genome. Therefore, one nucleic acid molecule can 
. be designed to cleave all the different isolates of HIV. Nucleic acid molecules designed 

25 against conserved regions of various HIV isolates can enable efficient inhibition of HIV 
replication in diverse subject populations and can ensure the effectiveness of the nucleic acid 
molecules against HIV quasi species which evolve due to mutations in the non-conserved 
regions of the HIV genome. 

In one embodiment, the invention features the use of an enzymatic nucleic acid 
30 molecule, preferably in the hammerhead, NCH, G-cleaver, amberzyme, zinzyme and/or 
DNAzyme motif, to down-regulate the expression of HTV genes or inhibit the replication of 
HIV. 
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The invention features novel nucleic acid molecules, siRNA molecules and methods to 
modulate gene expression, for example, genes encoding HER2. In particular, the instant 
invention features nucleic-acid based molecules and methods to inhibit the expression of 
HER2. 

The invention features one or more nucleic acid-based molecules and methods that 
independently or in combination modulate the expression of a gene or genes encoding HER2. 
In particular embodiments, the invention features nucleic acid-based molecules and methods 
that modulate the expression of HER2 gene, for example, Genbank Accession No. 
NMJ)04448. 

The description below of the various aspects and embodiments is provided with 
reference to an exemplary HER2 gene, referred to herein as HER2 but also known as ERB2, 
ERB-B2, NEU, NGL, and V-ERB-B2. However, the various aspects and embodiments are 
also directed to other genes which encode HER2 proteins and similar proteins to HER2. 
Those additional genes can be analyzed for target sites using the methods described for 
HER2. Thus, the inhibition and the effects of such inhibition of the other genes can be 
performed as described herein. 

In one embodiment, the invention features the use of an enzymatic nucleic acid 
molecule, preferably in the hammerhead, NCH, G-cleaver, amberzyme, zinzyme and/or 
DNAzyme motif, to down-regulate the expression of HER2 genes or inhibit HER2 activity. 

By "modulate" is meant that the expression of the gene, or level of RNAs or 
equivalent RNAs encoding one or more protein subunits or components, or activity of one or 
more proteins is up-regulated or down-regulated, such that the expression, level, or activity is 
greater than or less than that observed in the absence of the nucleic acid molecules of the 
invention. 

By "inhibit" or "down-regulate" it is meant that the expression of the gene, or level of 
RNAs or equivalent RNAs encoding one or more protein subunits or components, or activity 
of one or more protein subunits or components, such as Ras, HIV, and/or HER2 protein or 
proteins, is reduced below that observed in the absence of the nucleic acid molecules of the 
invention. In one embodiment, inhibition or down-regulation with the enzymatic nucleic acid 
molecule preferably is below that level observed in the presence of an enzymatically inactive 
or attenuated enzymatic nucleic acid molecule that is able to bind to the same site on the 
target RNA, but is unable to cleave that RNA. In another embodiment, inhibition or down- 
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regulation with an antisense oligonucleotide is preferably below that level observed in the 
presence of, for example, an oligonucleotide with scrambled sequence or with mismatches. In 
another embodiment, inhibition or down-regulation with an siRNA molecule is preferably 
below that level observed in the presence of, for example, an oligonucleotide with scrambled 
5 sequence or with mismatches. In another embodiment, inhibition or down-regulation of Ras, 
HTV, or HER2 expression and/or activity with the nucleic acid molecule of the instant 
invention is greater in the presence of the nucleic acid molecule than in its absence. 

By "up-regulate" is meant that the expression of the gene, or level of RNAs or 
equivalent RNAs encoding one or more protein subunits or components, or activity of one or 
1 0 more protein subunits or components, such as Ras, HIV, or HER2 protein or proteins, is 
greater than that observed in the absence of the nucleic acid molecules of the invention. For 
example, the expression of a gene, such as Ras, HIV, or HER2 gene, can be increased in order 
to treat, prevent, ameliorate, or modulate a pathological condition caused or exacerbated by 
an absence or low level of gene expression. 

15 By "enzymatic nucleic acid molecule" as used herein, is meant a nucleic acid molecule 

which has complementarity in a substrate binding region to a specified gene target, and also 
has an enzymatic activity which is active to specifically cleave target RNA. That is, the 
enzymatic nucleic acid molecule is able to intermolecularly cleave RNA and thereby 
inactivate a target RNA molecule. These complementary regions allow sufficient 

20 hybridization of the enzymatic nucleic acid molecule to the target RNA and thus permit 
cleavage. One hundred percent complementarity is preferred, but complementarity as low as 
50-75% can also be useful in this invention (see for example Werner and Uhlenbeck, 1995, 
Nucleic Acids Research, 23, 2092-2096; Hammann et al 9 1999, Antisense and Nucleic Acid 
Drug Dev., 9, 25-3 1). The nucleic acids can be modified at the base, sugar, and/or phosphate 

25 groups. The term DNAzyme-based enzymatic nucleic acid is used interchangeably with 
phrases such as catalytic DNA, aptazyme or aptamer-binding DNAzyme, regulatable 
DNAzyme, catalytic oligonucleotides, nucleozyme, DNAzyme, endoribonuclease, 
endonuclease, minizyme, leadzyme, oligozyme or DNA enzyme. All of these terminologies 
describe nucleic acid molecules with enzymatic activity. The specific enzymatic nucleic acid 

30 molecules described in the instant application are not limiting in the invention and those 
skilled in the art will recognize that all that is important in an enzymatic nucleic acid 
molecule of this invention is that it have a specific substrate binding site which is 
complementary to one or more of the target nucleic acid regions, and that it have nucleotide 
sequences within or surrounding that substrate binding site which impart a nucleic acid 

35 cleaving and/or ligation activity to the molecule. 
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By "nucleic acid molecule" as used herein is meant a molecule having nucleotides. The 
nucleic acid can be single, double, or multiple stranded and can comprise modified or 
unmodified nucleotides or non-nucleotides or various mixtures and combinations thereof. 

By "en2ymatic portion" or "catalytic domain" is meant that portion/region of the 
enzymatic nucleic acid molecule essential for cleavage of a nucleic acid substrate (for 
example see Figures 1-4). 

By "substrate binding arm" or "substrate binding domain" is meant that portion/region 
of a enzymatic nucleic acid which is able to interact, for example via complementarity (z.e., 
able to base-pair with), with a portion of its substrate. Preferably, such complementarity is 
100%, but can be less if desired. For example, as few as 10 bases out of 14 can be base-paired 
(see for example Werner and Uhlenbeck, 1995, Nucleic Acids Research, 23, 2092-2096; 
Hammann et al, 1 999, Antisense and Nucleic Acid Drug Dev., 9, 25-3 1). Examples of such 
arms are shown generally in Figures 1-3. That is, these arms contain sequences within a 
enzymatic nucleic acid which are intended to bring enzymatic nucleic acid and target RNA 
together through complementary base-pairing interactions. The enzymatic nucleic acid of the 
invention can have binding arms that are contiguous or non-contiguous and can be of varying 
lengths. The length of the binding arm(s) are preferably greater than or equal to four 
nucleotides and of sufficient length to stably interact with the target RNA; preferably 12-100 
nucleotides; more preferably 14-24 nucleotides long (see for example Werner and Uhlenbeck, 
supra; Hamman et al, supra; Hampel et al, EP0360257; Berzal-Herranz et al, 1993, EMBO 
J., 12, 2567-73). If two binding arms are chosen, the design is such that the length of the 
binding arms are symmetrical (i.e., each of the binding arms is of the same length; e.g., five 
and five nucleotides, or six and six nucleotides, or seven and seven nucleotides long) or 
asymmetrical (i.e., the binding arms are of different length; e.g., six and three nucleotides; 
three and six nucleotides long; four and five nucleotides long; four and six nucleotides long; 
four and seven nucleotides long; and the like). 

By "Inozyme" or "NCH" motif or configuration is meant, an enzymatic nucleic acid 
molecule comprising a motif as is generally described as NCH Rz in Figure 1 and in Ludwig 
et al, International PCT Publication No. WO 98/58058 and US Patent Application Serial No. 
08/878,640. Inozymes possess endonuclease activity to cleave nucleic acid substrates having 
a cleavage triplet NCH/, where N is a nucleotide, C is cytidine and H is adenosine, uridine or 
cytidine, and "/" represents the cleavage site. H is used interchangeably with X. Inozymes 
can also possess endonuclease activity to cleave nucleic acid substrates having a cleavage 
triplet NCN/, where N is a nucleotide, C is cytidine, and "/" represents the cleavage site. T' 
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in Figure 1 represents an Inosine nucleotide, preferably a ribo-Inosine or xylo-Inosine 
nucleoside. 

By "G-cleaver" motif or configuration is meant, an enzymatic nucleic acid molecule 
comprising a motif as is generally described as G-cleaver Rz in Figure 1 and in Eckstein et 
5 al, US 6,127,173. G-cleavers possess endonuclease activity to cleave nucleic acid substrates 
having a cleavage triplet NYN/, where N is a nucleotide, Y is uridine or cytidine and 
represents the cleavage site. G-cleavers can be chemically modified as is generally shown in 
Figure 1. 

By "amberzyme" motif or configuration is meant, an enzymatic nucleic acid molecule 
10 comprising a motif as is generally described in Figure 2 and in Beigelman et ai, 
International PCT publication No. WO 99/55857 and US Patent Application Serial No. 
09/476,387. Amberzymes possess endonuclease activity to cleave nucleic acid substrates 
having a cleavage triplet NG/N, where N is a nucleotide, G is guanosine, and "/" represents 
the cleavage site. Amberzymes can be chemically modified to increase nuclease stability 
1 5 through substitutions as are generally shown in Figure 2. In addition, differing nucleoside 
and/or non-nucleoside linkers can be used to substitute the 5'-gaaa-3' loops shown in the 
figure. Amberzymes represent a non-limiting example of an enzymatic nucleic acid molecule 
that does not require a ribonucleotide (2' -OH) group within its own nucleic acid sequence for 
activity. 

20 By "zinzyme" motif or configuration is meant, an enzymatic nucleic acid molecule 

comprising a motif as is generally described in Figure 3 and in Beigelman et al, International 
PCT publication No. WO 99/55857 and US Patent Application Serial No. 09/918,728. 
Zinzymes possess endonuclease activity to cleave nucleic acid substrates having a cleavage 
triplet including but not limited to YG/Y, where Y is uridine or cytidine, and G is guanosine 

25 and "/" represents the cleavage site. Zinzymes can be chemically modified to increase 
nuclease stability through substitutions as are generally shown in Figure 3, including 
substituting 2'-0-methyl guanosine nucleotides for guanosine nucleotides. In addition, 
differing nucleotide and/or non-nucleotide linkers can be used to substitute the 5'-gaaa-2' 
loop shown in the figure. Zinzymes represent a non-limiting example of an enzymatic nucleic 

30 acid molecule that does not require a ribonucleotide (2'-OH) group within its own nucleic 
acid sequence for activity. 

By 'DNAzyme' is meant, an enzymatic nucleic acid molecule that does not require the 
presence of a 2 , -OH group within its own nucleic acid sequence for activity. In particular 
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embodiments the enzymatic nucleic acid molecule can have an attached linker or linkers or 
other attached or associated groups, moieties, or chains containing one or more nucleotides 
with 2'-OH groups. DNAzymes can be synthesized chemically or expressed endogenously in 
vivo, by means of a single stranded DNA vector or equivalent thereof. An example of a 
5 DNAzyme is shown in Figure 4 and is generally reviewed in Usman et al, US patent No., 
6,159,714; Chartrand et al, 1995, NAR 23, 4092; Breaker et al, 1995, Chem. Bio. 2, 655; 
Santoro et al, 1997, PNAS 94, 4262; Breaker, 1999, Nature Biotechnology, 17, 422-423; and 
Santoro et. al, 2000, J. Am. Chem. Soc, 122, 2433-39. The "10-23" DNAzyme motif is one 
particular type of DNAzyme that was evolved using in vitro selection, see Santoro et al, 
10 supra and as generally described in Joyce et al, US 5,807,718. Additional DNAzyme motife 
can be selected by using techniques similar to those described in these references, and hence, 
are within the scope of the present invention. DNAzymes of the invention can comprise 
nucleotides modified at the nucleic acid base, sugar, or phosphate backbone. Non-limiting 
examples of sugar modifications that can be used in DNAzymes of the invention include 2'- 
1 5 O-alkyl modifications such as 2'-0-methyl or 2'-0-allyl, 2'-C-alkyl modifications such as 2'- 
C-allyl, 2'-deoxy-2'-amino, 2'-halo modifications such as 2'-fluoro, 2'-chloro, or 2'-bromo, 
isomeric modifications such as arabinofuranose or xylofuranose based nucleic acids, and 
other sugar modifications such as 4'-thio or 4'-carbocyclic nucleic acids. Non-limiting 
examples of nucleic acid based modifications that can be used in DNAzymes of the invention 
20 include modified purine heterocycles, G-clamp heterocycles, and various modified pyrimidine 
cycles. Non-limiting examples of backbone modifications that can be used in DNAzymes of 
the invention include phosphorothioate, phosphorodithioate, phosphoramidate, and 
methylphosphonate intemucleotide linkages. DNAzymes of the invention can comprise 
naturally occurring nucleic acids, chimeras of chemically modified and naturally occurring 
25 nucleic acids, or completely modified nucleic acids. 

In general, enzymatic nucleic acids act by first binding to a target RNA Such binding 
occurs through the target binding portion of a enzymatic nucleic acid that is held in close 
proximity to an enzymatic portion of the molecule that acts to cleave the target RNA Thus, 
the enzymatic nucleic acid first recognizes and then binds a target RNA through 

30 complementary base-pairing, and once bound to the correct site, acts enzymatically to cut the 
target RNA. Strategic cleavage of such a target RNA will destroy its ability to direct 
synthesis of an encoded protein. After an enzymatic nucleic acid has bound and cleaved its 
RNA target, it is released from that RNA to search for another target and can repeatedly bind 
and cleave new targets. Thus, a single enzymatic nucleic acid molecule is able to cleave 

35 many molecules of target RNA. In addition, the enzymatic nucleic acid molecule is a highly 
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specific inhibitor of gene expression, with the specificity of inhibition depending not only on 
the base-pairing mechanism of binding to the target RNA, but also on the mechanism of 
target RNA cleavage. Single mismatches, or base-substitutions, near the site of cleavage can 
completely eliminate catalytic activity of an enzymatic nucleic acid molecule. 

5 By "sufficient length" is meant an oligonucleotide of greater than or equal to 3 

nucleotides that is of a length great enough to provide the intended function under the 
expected condition. For example, for binding arms of enzymatic nucleic acid "sufficient 
length" means that the binding arm sequence is long enough to provide stable binding to a 
target site under the expected binding conditions. Preferably, the binding arms are not so 
1 0 long as to prevent useful turnover of the nucleic acid molecule. 

By "stably interact" is meant interaction of oligonucleotides with target nucleic acid 
molecules (e.g., by forming hydrogen bonds with complementary nucleotides in the target 
under physiological conditions) that is sufficient to the intended purpose (e.g., cleavage of 
target RNA by an enzyme). 

15 By "equivalent" RNA to Ras is meant to include those naturally occurring RNA - 

molecules having homology (partial or complete) to Ras nucleic acids or encoding for 
proteins with similar function as Ras proteins in various organisms, including humans, 
rodents, primates, rabbits, pigs, protozoans, fungi, plants, and other microorganisms and 
parasites. The equivalent RNA sequence can also include, in addition to the coding region, 

20 regions such as a 5' -untranslated region, a 3 '-untranslated region, introns, a intron-exon 
junction and the like. 

By "equivalent" RNA to HIV is meant to include those naturally occurring RNA 
molecules having homology (partial or complete) to HIV nucleic acids or encoding for 
proteins with similar function as HIV proteins in various organisms, including human, rodent, 
25 primate, rabbit, pig, protozoans, fungi, plants, and other microorganisms and parasites. The 
equivalent RNA sequence also includes in addition to the coding region, regions such as 5'- 
untranslated region, 3 '-untranslated region, introns, intron-exon junction and the like. 

By "equivalent" RNA to HER2 is meant to include those naturally occurring RNA 
molecules having homology (partial or complete) to HER2 nucleic acids or encoding for 
30 proteins with similar function as HER2 proteins in various organisms, including humans, 
rodents, primates, rabbits, pigs, protozoans, fungi, plants, and other microorganisms and 
parasites. The equivalent RNA sequence also includes, in addition to the coding region, 
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regions such as a 5 '-untranslated region, a 3 5 -untranslated region, introns, a intron-exon 
junction and the like. 

By "homology" is meant the nucleotide sequence of two or more nucleic acid molecules 
is partially or completely identical. 

5 By "component" of HIV is meant a peptide or protein expressed from an HIV gene, for 

example nef f vif, tat, or rev viral gene products. 

By "component" of HER2 is meant a peptide or protein subunit expressed from a HER2 

gene. 

By "component" of Ras is meant a peptide or protein subunit expressed from a Ras 

1 0 gene. 

By "gene" it is meant a nucleic acid that encodes an RNA, for example, nucleic acid 
sequences including but not limited to structural genes encoding a polypeptide. 

"Complementarity" refers to the ability of a nucleic acid to form hydrogen bond or 
bonds with another RNA sequence by either traditional Watson-Crick or other non-traditional 

1 5 types. In reference to the nucleic molecules of the present invention, the binding free energy 
for a nucleic acid molecule with its target or complementary sequence is sufficient to allow 
the relevant function of the nucleic acid to proceed, e.g., enzymatic nucleic acid cleavage, 
antisense or triple helix inhibition. Determination of binding free energies for nucleic acid 
molecules is well known in the art (see, e.g., Turner et al, 1987, CSH Symp. Quant. Biol. LII 

10 pp.123-133; Frier et al, 1986, Proc. Nat. Acad. Sci. USA 83:9373-9377; Turner et al, 1987, 
J. Am. Chem. Soc. 109:3783-3785). A percent complementarity indicates the percentage of 
contiguous residues in a nucleic acid molecule that can form hydrogen bonds (e.g., Watson- 
Crick base pairing) with a second nucleic acid sequence (e.g., 5, 6, 7, 8, 9, 10 out of 10 being 
50%, 60%, 70%, 80%, 90%, and 100% complementary). "Perfectly complementary" means 

15 that all the contiguous residues of a nucleic acid sequence will hydrogen bond with the same 
number of contiguous residues in a second nucleic acid sequence. 

By "RNA" is meant a molecule comprising at least one ribonucleotide residue. By 
"ribonucleotide" or "2'-OH" is meant a nucleotide with a hydroxyl group at the T position of 
a p-D-ribo-furanose moiety. 

30 By "decoy " is meant a nucleic acid molecule, for example RNA or DNA, or aptamer 

that is designed to preferentially bind to a predetermined ligand. Such binding can result in 
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the inhibition or activation of a target molecule. A decoy or aptamer can compete with a 
naturally occurring binding target for the binding of a specific ligand. For example, it has 
been shown that over-expression of HIV trans-activation response (TAR) RNA can act as a 
"decoy" and efficiently binds HIV tat protein, thereby preventing it from binding to TAR 
5 sequences encoded in the HIV RNA (Sullenger et aL, 1990, Cell, 63, 601-608). This is but a 
specific example and those in the art will recognize that other embodiments can be readily 
generated using techniques generally known in the art, see for example Gold et aL, 1995, 
Annu. Rev. Biochem., 64, 763; Brody and Gold, 2000, J. BiotechnoL, 74, 5; Sun, 2000, Curr. 
Opin. Mol. Ther., 2, 100; Kusser, 2000, J. BiotechnoL, 74, 27; Hermann and Patel, 2000, 
10 Science, 287, 820; and Jayasena, 1999, Clinical Chemistry, 45, 1628. Similarly, a decoy can 
be designed to bind to Ras and block the binding of Ras or a decoy can be designed to bind to 
Ras and prevent interaction with the Ras protein. 

By "aptamer" or "nucleic acid aptamer" as used herein is meant a nucleic acid molecule 
that binds specifically to a target molecule wherein the nucleic acid molecule has sequence 
that is distinct from sequence recognized by the target molecule in its natural setting. 
Alternately, an aptamer can be a nucleic acid molecule that binds to a target molecule where 
the target molecule does not naturally bind to a nucleic acid. The target molecule can be any 
molecule of interest. For example, the aptamer can be used to bind to a ligand binding domain 
of a protein, thereby preventing interaction of the naturally occurring ligand with the protein. 
Similarly, the nucleic acid molecules of the instant invention can bind to RAS, Her-2 or HIV 
encoded RNA or proteins receptors to block activity of the activity of target protein or nucleic 
acid. This is a non-limiting example and those in the art will recognize that other 
embodiments can be readily generated using techniques generally known in the art, see for 
example Gold et aL, US 5,475,096 and 5,270,163; Gold et aL, 1995, Annu. Rev. Biochem., 
64, 763; Brody and Gold, 2000, J. BiotechnoL, 74, 5; Sun, 2000, Curr. Opin. Mol. Ther., 2, 
100; Kusser, 2000, J. BiotechnoL, 74, 27; Hermann and Patel, 2000, Science, 287, 820; and 
Jayasena, 1999, Clinical Chemistry, 45, 1628. 

The term "short interfering RNA" or "siRNA" as used herein refers to a double 
stranded nucleic acid molecule capable of RNA interference "RNAi", see for example Bass, 
30 2001, Nature, 411, 428-429; Elbashir et al., 2001, Nature, 41 1, 494-498; and Kreutzer et aL, 
International PCT Publication No. WO 00/44895; Zernicka-Goetz et aL, International PCT 
Publication No. WO 01/36646; Fire, International PCT Publication No. WO 99/32619; 
Plaetinck et aL, International PCT Publication No, WO 00/01846; Mello and Fire, 
International PCT Publication No. WO 01/29058; Deschamps-Depaillette, International PCT 
35 Publication No. WO 99/07409; and Li et al., International PCT Publication No. WO 
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00/44914. As used herein, siRNA molecules need not be limited to those molecules 
containing only RNA, but further encompasses chemically modified nucleotides and non- 
nucleotides. 

Nucleic acid molecules that modulate expression of Ras-specific RNAs represent a 
5 therapeutic approach to treat cancer, including, but not limited to colorectal cancer, bladder 
cancer, lung cancer, pancreatic cancer, breast cancer, or prostate cancer and any other cancer, 
disease or condition that responds to the modulation of Ras expression. 

Nucleic acid molecules that modulate expression of HIV-specific RNAs also represent 
a therapeutic approach to treat acquired immunodeficiency syndrome (ADDS) and/or any other 
1 0 disease, condition, or syndrome which respond to the modulation of HIV expression. 

Nucleic acid molecules that modulate expression of HER2-specific RNAs represent a 
therapeutic approach to treat cancer, including, but not limited to breast and ovarian cancer 
and any other cancer, disease or condition that responds to the modulation of HER2 
expression. 

15 In one embodiment of the inventions described herein, the enzymatic nucleic acid 

molecule is formed in a hammerhead or hairpin motif, but can also be formed in the motif of 
a hepatitis delta virus, group I intron, group H intron or RNase P RNA (in association with an 
RNA guide sequence), Neurospora VS RNA, DNAzymes, NCH cleaving motifs, or G- 
cleavers. Examples of such hammerhead motifs are described by Dreyfus, supra, Rossi et al, 

20 1992, AIDS Research and Human Retroviruses 8, 183; of hairpin motifs by Hampel et al, 
EP0360257, Hampel and Tritz, 1989 Biochemistry 28, 4929, Feldstein et al y 1989, Gene 82, 
53, Haseloff and Gerlach, 1989, Gene, 82, 43, and Hampel et al, 1990 Nucleic Acids Res. 18, 
299; Chowrira & McSwiggen, US. Patent No. 5,631,359; of the hepatitis delta virus motif is 
described by Perrotta and Been, 1992 Biochemistry 31, 16; of the RNase P motif by Guenier- 

25 Takada et al, 1983 Cell 35, 849; Forster and Altaian, 1990, Science 249, 783; Li and Altaian, 
1996, Nucleic Acids Res. 24, 835; Neurospora VS RNA ribozyme motif is described by 
Collins (Saville and Collins, 1990 Cell 61, 685-696; Saville and Collins, 1991 Proc. Natl 
Acad. ScL USA 88, 8826-8830; Collins and Olive, 1993 Biochemistry 32, 2795-2799; Guo 
and Collins, 1995, EMBO. J. 14, 363); Group II introns are described by Griffin et al, 1995, 

30 Chem. Biol. 2, 761; Michels and Pyle, 1995, Biochemistry 34, 2965; Pyle et al, International 
PCT Publication No. WO 96/22689; of the Group I intron by Cech et al., U.S. Patent 
4,987,071 and of DNAzymes by Usman et al., International PCT Publication No. WO 
95/11304; Chartrand et al, 1995, NAR 23, 4092; Breaker et al, 1995, Chem. Bio. 2, 655; 
Santoro et al, 1997, PNAS 94, 4262, and Beigelman et al, International PCT publication No. 
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WO 99/55857. NCH cleaving motifs are described in Ludwig & Sproat, International PCT 
Publication No. WO 98/58058; and G-cleavers are described in Kore et al, 1998, Nucleic 
Acids Research 26, 4116-4120 and Eckstein et al, International PCT Publication No. WO 
99/16871. Additional motifs such as the Aptazyme (Breaker et al, WO 98/43993), 
5 Amberzyme (Class I motif; Figure 2; Beigelman et al, U.S. Serial No. 09/301,511) and 
Zinzyme (Figure 3) (Beigelman et al, U.S. Serial No. 09/301,511), all included by reference 
herein including drawings, can also be used in the present invention. These specific motifs or 
configurations are not limiting in the invention and those skilled in the art will recognize that 
all that is important in an enzymatic nucleic acid molecule of this invention is that it has a 
1 0 specific substrate binding site which is complementary to one or more of the target gene RNA 
regions, and that it have nucleotide sequences within or surrounding that substrate binding 
site which impart an RNA cleaving activity to the molecule (Cech et al, U.S. Patent No. 
4,987,071). 

In one embodiment of the present invention, a nucleic acid molecule of the instant 

15 invention can be between about 10 and 100 nucleotides in length. Exemplary enzymatic 
nucleic acid molecules of the invention are shown in the Tables herein. For example, 
enzymatic nucleic acid molecules of the invention are preferably between about 15 and 50 
nucleotides in length, more preferably between about 25 and 40 nucleotides in length, e.g., 
34, 36, or 38 nucleotides in length (for example see Jarvis et al, 1996, J. Biol Chem., 271, 

20 29107-29112). Exemplary DNAzymes of the invention are preferably between about 15 and 
40 nucleotides in length, more preferably between about 25 and 35 nucleotides in length, e.g., 
29, 30, 31, or 32 nucleotides in length (see for example Santoro et al, 1998, Biochemistry, 
37, 13330-13342; Chartrand et al, 1995, Nucleic Acids Research, 23, 4092-4096). 
Exemplary antisense molecules of the invention are preferably between about 15 and 75 

25 nucleotides in length, more preferably between about 20 and 35 nucleotides in length, e.g., 
25, 26, 27, or 28 nucleotides in length (see for example Woolf et al, 1992, PNAS., 89, 7305- 
7309; Milner et al, 1997, Nature Biotechnology, 15, 537-541). Exemplary triplex forming 
oligonucleotide molecules of the invention are preferably between about 10 and 40 
nucleotides in length, more preferably between about 12 and 25 nucleotides in length, e.g., 

30 18, 19, 20, or 21 nucleotides in length (see for example Maher et al, 1990, Biochemistry, 29, 
8820-8826; Strobel and Dervan, 1990, Science, 249, 73-75). Those skilled in the art will 
recognize that all that is required is for a nucleic acid molecule to be of length and 
conformation sufficient and suitable for the nucleic acid molecule to interact with its target 
and/or catalyze a reaction contemplated herein. The length of nucleic acid molecules of the 

35 instant invention are not limiting within the general limits stated. 
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Preferably, a nucleic acid molecule that modulates, for example, down-regulates Ras, 
HIV, and/or HER2 expression and/or activity, comprises between 12 and 100 bases 
complementary to a RNA molecule of Ras, HTV, and/or HER2 respectively. Even more 
preferably, a nucleic acid molecule that modulates Ras, HTV, and/or HER2 expression 
5 comprises between 14 and 24 bases complementary to a RNA molecule of Ras, HTV, and/or 
HER2 respectively. 

The invention provides a method for producing a class of nucleic acid-based gene 
modulating agents that exhibit a high degree of specificity for RNA of a desired target. For 
example, an enzymatic nucleic acid molecule is preferably targeted to a highly conserved 

1 0 sequence region of target RNAs encoding Ras (and specifically a Ras gene) such that specific 
treatment of a disease or condition can be provided with either one or several nucleic acid 
molecules of the invention. Such nucleic acid molecules can be delivered exogenously to 
specific tissue or cellular targets as required. Alternatively, the nucleic acid molecules (e.g., 
enzymatic nucleic acid molecules, siRNA antisense, and/or DNAzymes) can be expressed 

1 5 from DNA and/or RNA vectors that are delivered to specific cells. 

As used herein "cell" is used in its usual biological sense, and does not refer to an entire 
multicellular organism. A cell can, for example, be in vitro, e.g., in cell culture, or present in 
a multicellular organism, including, e.g., birds, plants and mammals such as humans, cows, 
sheep, apes, monkeys, swine, dogs, and cats. The cell can be prokaryotic (e.g., bacterial cell) 
20 or eukaryotic (e.g. , mammalian or plant cell). 

By "Ras proteins" is meant, a peptide or protein comprising Ras tyrosine kinase-type 
cell surface receptor or a peptide or protein encoded by a Ras gene, such as K-Ras, H-Ras, or 
N-Ras. 

By "HTV proteins" is meant, a peptide or protein comprising a component of HIV or a 
25 peptide or protein encoded by a HIV gene. 

By "HER2 proteins" is meant, a peptide or protein comprising HER2/ERB 2/NEU 
tyrosine kinase-type cell surface receptor or a peptide or protein encoded by a 
HER2/ERB2/NEU gene. 

By "highly conserved sequence region" is meant, a nucleotide sequence of one or more 
30 regions in a target gene that does not vary significantly from one generation to the other or 
from one biological system to the other. 
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Nucleic acid-based modulators, including inhibitors, of Ras expression are useful for 
the prevention and/or treatment of cancer, including but not limited to breast cancer and 
ovarian cancer and any other disease or condition that respond to the modulation of Ras 
expression. 

Nucleic acid-based inhibitors of HIV expression are useful for the prevention and/or 
treatment of acquired immunodeficiency disease (AIDS) and related diseases and conditions, 
including but not limited to Kaposi's sarcoma, lymphoma, cervical cancer, squamous cell 
carcinoma, cardiac myopathy, rheumatic diseases, and opportunistic infection, for example 
Pneumocystis carinii, Cytomegalovirus, Herpes simplex, Mycobacteria, Cryptococcus, 
Toxoplasma, Progressive multifocal leucoencepalopathy (Papovavirus), Mycobacteria, 
Aspergillus, Cryptococcus, Candida, Cryptosporidium, Isospora belli, Microsporidia and any 
other disease or condition which respond to the modulation of HIV expression. 

Nucleic acid-based inhibitors of HER2 expression are useful for the prevention and/or 
treatment of cancer, including but not limited to breast cancer and ovarian cancer and any 
other disease or condition that respond to the modulation of HER2 expression. 

By "related" is meant that the reduction of RAS, HTV, or HER2 expression (specifically 
RAS, HIV, or HER2 genes respectively) RNA levels and thus reduction in the level of the 
respective protein relieves, to some extent, the symptoms of the disease or condition. 

The nucleic acid-based molecules of the invention can be added directly, or can be 
complexed with cationic lipids, packaged within liposomes, or otherwise delivered to target 
cells or tissues. The nucleic acid or nucleic acid complexes can be locally administered to 
relevant tissues ex vivo, or in vivo through injection or infusion pump, with or without their 
incorporation in biopolymers. In certain embodiments, the enzymatic nucleic acid molecules 
comprise sequences that are complementary to the substrate sequences in the Tables herein. 
Examples of such enzymatic nucleic acid molecules also are shown in the Tables herein. 
Examples of such enzymatic nucleic acid molecules consist essentially of sequences defined 
in these tables. 

In another embodiment, the invention features siRNA, antisense nucleic acid molecules 
and 2-5 A chimeras comprising sequences complementary to the substrate sequences shown in 
the Tables herein. Such nucleic acid molecules can comprise sequences as shown for the 
binding arms of the enzymatic nucleic acid molecules in the Tables. Similarly, triplex 
molecules can be targeted to corresponding DNA target regions; such molecules can comprise 
the DNA equivalent of a target sequence or a sequence complementary to the specified target 
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(substrate) sequence. Typically, antisense molecules are complementary to a target sequence 
along a single contiguous sequence of the antisense molecule. However, in certain 
embodiments, an antisense molecule can bind to a substrate such that the substrate molecule 
forms a loop, and/or an antisense molecule can bind such that the antisense molecule forms a 
5 loop. Thus, the antisense molecule can be complementary to two or more non-contiguous 
substrate sequences. In addition, two or more non-contiguous sequence portions of an 
antisense molecule can be complementary to a target sequence. 

By "consists essentially of is meant that the active nucleic acid molecule of the 
invention, for example, an enzymatic nucleic acid molecule, contains an enzymatic center or 

10 core equivalent to those in the examples, and binding arms able to bind RNA such that 
cleavage at the target site occurs. Other sequences can be present that do not interfere with 
such cleavage. Thus, a core region of an enzymatic nucleic acid molecule can, for example, 
include one or more loop, stem-loop structure, or linker that does not prevent enzymatic 
activity. Thus, various regions in the sequences in the Tables can be such a loop, stem-loop, 

1 5 nucleotide linker, and/or non-nucleotide linker and can be represented generally as sequence 
"X". The nucleic acid molecules of the instant invention, such as Hammerhead, Inozyme, G- 
cleaver, amberzyme, zinzyme, DNAzyme, antisense, 2-5A antisense, triplex forming nucleic 
acid, and decoy nucleic acids, can contain other sequences or non-nucleotide linkers that do 
not interfere with the function of the nucleic acid molecule. 

20 Sequence X can be a linker of > 2 nucleotides in length, preferably 3, 4, 5, 6, 7, 8, 9, 10, 

15, 20, 26, 30, where the nucleotides can preferably be internally base-paired to form a stem 
of preferably > 2 base pairs. Alternatively or in addition, sequence X can be a non-nucleotide 
linker. In yet another embodiment, the nucleotide linker X can be a nucleic acid aptamer, such 
as an ATP aptamer, Ras Rev aptamer (RRE), Ras Tat aptamer (TAR) and others (for a review 

25 see Gold et al, 1995, Annu. Rev. Biochem., 64, 763; and Szostak & Ellington, 1993, in The 
RNA World, ed. Gesteland and Atkins, pp. 511, CSH Laboratory Press). A "nucleic acid 
aptamer" as used herein is meant to indicate a nucleic acid sequence capable of interacting 
with a ligand. The ligand can be any natural or a synthetic molecule, including but not limited 
to a resin, metabolites, nucleosides, nucleotides, drugs, toxins, transition state analogs, 

30 peptides, lipids, proteins, amino acids, nucleic acid molecules, hormones, carbohydrates, 
receptors, cells, viruses, bacteria and others. 

In yet another embodiment, a non-nucleotide linker X is as defined herein. Non- 
nucleotides as can include abasic nucleotide, polyether, polyamine, polyamide, peptide, 
carbohydrate, lipid, or polyhydrocarbon compounds. Specific examples include those 
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described by Seela and Kaiser, Nucleic Acids Res. 1990, 75:6353 and Nucleic Acids Res. 
1987, 75:3113; Cload and Schepartz, J. Am. Chem. Soc. 1991, 773:6324; Richardson and 
Schepartz, J. Am. Chem. Soc. 1991, 773:5109; Ma et al, Nucleic Acids Res. 1993, 27:2585 
and Biochemistiy 1993, 32:1751; Durand et al, Nucleic Acids Res. 1990, 75:6353; McCurdy 
5 et al, Nucleosides & Nucleotides 1991, 70:287; Jschke et al, Tetrahedron Lett. 1993, 
34:301; Ono et al, Biochemistiy 1991, 30:9914; Arnold et al, International Publication No. 
WO 89/02439; Usman et al, International Publication No. WO 95/06731; Dudycz et al, 
International Publication No. WO 95/11910 and Ferentz and Verdine, J. Am. Chem. Soc. 
1991, 773:4000, all hereby incorporated by reference herein. A "non-nucleotide" further 

1 0 means any group or compound that can be incorporated into a nucleic acid chain in the place 
of one or more nucleotide units, including either sugar and/or phosphate substitutions, and 
allows the remaining bases to exhibit their enzymatic activity. The group or compound can 
be abasic in that it does not contain a commonly recognized nucleotide base, such as 
adenosine, guanine, cytosine, uracil or thymine. Thus, in a preferred embodiment, the 

1 5 invention features an enzymatic nucleic acid molecule having one or more non-nucleotide 
moieties, and having enzymatic activity to cleave an RNA or DNA molecule. 

In another aspect of the invention, enzymatic nucleic acid molecules, siRNA molecules 
or antisense molecules that interact with target RNA molecules and modulate gene expression 
activity are expressed from transcription units inserted into DNA or RNA vectors. The 

20 recombinant vectors are preferably DNA plasmids or viral vectors. Enzymatic nucleic acid 
molecule or antisense expressing viral vectors can be constructed based on, but not limited to, 
adeno-associated virus, retrovirus, adenovirus, or alphavirus as well as others known in the 
art. Preferably, recombinant vectors capable of expressing enzymatic nucleic acid molecules 
or antisense are delivered as described below, and persist in target cells. Alternatively, viral 

25 vectors can be used that provide for transient expression of enzymatic nucleic acid molecules 
or antisense. Such vectors can be repeatedly administered as necessary. Once expressed, the 
enzymatic nucleic acid molecules or antisense bind to target RNA and modulate its function 
or expression. Delivery of enzymatic nucleic acid molecule or antisense expressing vectors 
can be systemic, such as by intravenous or intramuscular administration, by administration to 

30 target cells ex-planted from the patient followed by reintroduction into the patient, or by any 
other means that allows for introduction into a desired target cell. Antisense DNA and 
DNAzymes can be expressed via the use of a single stranded DNA intracellular expression 
vector. 

By "vectors" is meant any nucleic acid- and/or viral-based technique used to deliver a 
35 desired nucleic acid. 
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By "subject'* or "patient" is meant an organism that is a donor or recipient of explanted 
cells or title cells of the organism. "Subject" or "patient" also refers to an organism to which 
the nucleic acid molecules of the invention can be administered. Preferably, a subject or 
patient is a mammal or mammalian cells. More preferably, a subject or patient is a human or 
5 human cells. 

By "enhanced enzymatic activity" is meant to include activity measured in cells and/or 
in vivo where the activity is a reflection of both the catalytic activity and the stability of the 
nucleic acid molecules of the invention. In this invention, the product of these properties can 
be increased in vivo compared to an all RNA enzymatic nucleic acid or all DNA enzyme, for 
1 0 example, with a nucleic acid molecule comprising chemical modifications. In some cases, the 
activity or stability of the nucleic acid molecule can be decreased (i.e., less than ten-fold), but 
the overall activity of the nucleic acid molecule is enhanced, in vivo. 

Nucleic acid molecules of the instant invention, individually, or in combination or in 
conjunction with other drugs, can be used to treat diseases or conditions discussed above. For 
1 5 example, to treat a disease or condition associated with the levels of Ras, HIV, or HER2, a 
subject can be treated, or other appropriate cells can be treated, as is evident to those skilled 
in the art, individually or in combination with one or more drugs under conditions suitable for 
the treatment. 

In a further embodiment, the described molecules, such as antisense, siRNA, or 
20 enzymatic nucleic acid molecules, can be used in combination with other known treatments to 
treat conditions or diseases discussed above. For example, the described molecules can be 
used in combination with one or more known therapeutic agents to treat cancer, for example 
colorectal cancer, bladder cancer, lung cancer, pancreatic cancer, breast cancer, or prostate 
cancer, and any other disease or condition that respond to the modulation of Ras expression. 

25 In another embodiment, the invention features nucleic acid-based inhibitors {e.g., 

enzymatic nucleic acid molecules, (including DNAzymes), siRN A and methods for their use 
to down regulate or inhibit the expression of genes (e.g., Ras genes) capable of progression 
and/or maintenance of cancer and/or other disease states that respond to the modulation of 
Ras expression. 

30 In a further embodiment, the described molecules, such as antisense, siRNA, or 

enzymatic nucleic acids, can be used in combination with other known treatments to treat 
conditions or diseases discussed above. For example, the described molecules can be used in 
combination with one or more known therapeutic agents to treat acquired immunodeficiency 
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disease (AIDS) and related diseases and conditions, including but not limited to Kaposi's 
sarcoma, lymphoma, cervical cancer, squamous cell carcinoma, cardiac myopathy, rheumatic 
diseases, and opportunistic infection, for example Pneumocystis carinii, Cytomegalovirus, 
Herpes simplex, Mycobacteria, Cryptococcus, Toxoplasma, Progressive multifocal 
5 leucoencepalopathy (Papovavirus), Mycobacteria, Aspergillus, Cryptococcus, Candida, 
Cryptosporidium, Isospora belli, Microsporidia and any other disease or condition which 
respond to the modulation of HIV expression. 

Nucleic acid molecules of the instant invention, individually, or in combination or in 
conjunction with other drugs, can be used to treat diseases or conditions discussed above. For 
10 example, to treat a disease or condition associated with the levels of HER2, a patient can be 
treated, or other appropriate cells can be treated, as is evident to those skilled in the art, 
individually or in combination with one or more drugs under conditions suitable for the 
treatment. 

In a further embodiment, the described molecules, such as antisense, siRNA or 
1 5 enzymatic nucleic acid molecules, can be used in combination with other known treatments to 
treat conditions or diseases discussed above. For example, the described molecules can be 
used in combination with one or more known therapeutic agents to treat cancer, for example 
ovarian cancer and/or breast cancer, and any other disease or condition that respond to the 
modulation of HER2 expression. 

20 hi another embodiment, the invention features nucleic acid-based inhibitors (e.g., 

enzymatic nucleic acid molecules, (including ribozymes, antisense nucleic acids, 2-5A 
antisense chimeras, triplex DNA, antisense nucleic acids , containing RNA cleaving chemical 
groups), siRNA and methods for their use to down regulate or inhibit the expression of genes 
(e.g., HER2 genes) capable of progression and/or maintenance of cancer and/or other disease 

25 states that respond to the modulation of HER2 expression. 

By "comprising" is meant including, but not limited to, whatever follows the word 
"comprising". Thus, use of the term "comprising" indicates that the listed elements are 
required or mandatory, but that other elements are optional and may or may not be present. 
By "consisting of is meant including, and limited to, whatever follows the phrase "consisting 
30 of 1 . 

Other features and advantages of the invention will be apparent from the following 
description of the preferred embodiments thereof, and from the claims. 
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Mechanism of action of Nucleic Acid Molecules of the Invention as is Know in the Art 

Antisense : Antisense molecules can be modified or unmodified RNA, DNA, or mixed 
polymer oligonucleotides and primarily function by specifically binding to matching 
sequences resulting in inhibition of peptide synthesis (Wu-Pong, Nov 1994, BioPharm, 20- 
33). The antisense oligonucleotide binds to target RNA by Watson Crick base-pairing and 
blocks gene expression by preventing ribosomal translation of the bound sequences either by 
steric blocking or by activating RNase H enzyme. Antisense molecules can also alter protein 
synthesis by interfering with RNA processing or transport from the nucleus into the 
cytoplasm (Mukhopadhyay & Roth, 1996, Crit. Rev. in Oncogenesis 7, 151-190). 

In addition, binding of single stranded DNA to RNA can result in nuclease degradation 
of the heteroduplex (Wu-Pong, supra; Crooke, supra). Backbone modified DNA chemistry 
which have been thus far been shown to act as substrates for RNase H are phosphorothioates, 
phosphorodithioates, and borontrifluoridates. In addition, 2'-arabino and 2'-fluoro arabino- 
containing oligos can also activate RNase H activity. 

A number of antisense molecules have been described that utilize novel configurations 
of chemically modified nucleotides, secondary structure, and/or RNase H substrate domains 
(Woolf et al, International PCT Publication No. WO 98/13526; Thompson et al, 
International PCT Publication No. WO 99/54459; Hartmann et al, USSN 60/101,174, filed 
on September 21, 1998). All of these references are incorporated by reference herein in their 
entirety. 

In addition, antisense deoxyoligoribonucleotides can be used to target RNA by means 
of DNA-RNA interactions, thereby activating RNase H, which digests the target RNA in the 
duplex. Antisense DNA can be expressed via the use of a single stranded DNA intracellular 
expression vector or equivalents and variations thereof. 

RNA interference : RNA interference refers to the process of sequence specific post 
transcriptional gene silencing in animals mediated by short interfering RNAs (siRNA) (Fire et 
al, 1998, Nature, 391 , 806). The corresponding process in plants is commonly referred to as 
post transcriptional gene silencing or RNA silencing and is also referred to as quelling in 
fungi. The process of post transcriptional gene silencing is thought to be an evolutionarily 
conserved cellular defense mechanism used to prevent the expression of foreign genes which 
is commonly shared by diverse flora and phyla (Fire et al, 1999, Trends Genet, 15, 358). 
Such protection from foreign gene expression may have evolved in response to the production 
of double stranded RNAs (dsRNA) derived from viral infection or the random integration of 
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transposon elements into a host genome via a cellular response that specifically destroys 
homologous single stranded RNA or viral genomic RNA. The presence of dsRNA in cells 
triggers the RNAi response though a mechanism that has yet to be fully characterized. This 
mechanism appears to be different from the interferon response that results from dsRNA 
5 mediated activation of protein kinase PKR and 2\5'-oligoadenylate synthetase resulting in 
non-specific cleavage of mRNA by ribonuclease L. 

The presence of long dsRNAs in cells stimulates the activity of a ribonuclease in 
enzyme referred to as dicer. Dicer is involved in the processing of the dsRNA into short 
pieces of dsRNA known as short interfering RNAs (siRNA) (Berstein et al, 2001, Nature, 

10 409, 363). Short interfering RNAs derived from dicer activity are typically about 21-23 
nucleotides in length and comprise about 19 base pair duplexes. Dicer has also been 
implicated in the excision of 21 and 22 nucleotide small temporal RNAs (stRNA) from 
precursor RNA of conserved structure that are implicated in translational control (Hutvagner 
et al t 2001, Science, 293, 834). The RNAi response also features an endonuclease complex 

1 5 containing a siRNA, commonly referred to as an RNA-induced silencing complex (RISC), 
which mediates cleavage of single stranded RNA having sequence homologous to the siRNA. 
Cleavage of the target RNA takes place in the middle of the region complementary to the 
guide sequence of the siRNA duplex (Elbashir et al, 2001, Genes Dev., 15, 188). 

Short interfering RNA mediated RNAi has been studied in a variety of systems. Fire et 

20 al, 1998, Nature, 391, 806, were the first to observe RNAi in C. Elegans. Wianny and 
Goetz, 1999, Nature Cell Biol, 2, 70, describes RNAi mediated by dsRNA in mouse 
embryos. Hammond et al, 2000, Nature, 404, 293, describe RNAi in Drosophila cells 
transfected with dsRNA. Elbashir et al, 2001, Nature, 411, 494, describe RNAi induced by 
introduction of duplexes of synthetic 21 -nucleotide RNAs in cultured mammalian cells 

25 including human embryonic kidney and HeLa cells. Recent work in Drosophila embryonic 
lysates has revealed certain requirements for siRNA length, structure, chemical composition, 
and sequence that are essential to mediate efficient RNAi activity. These studies have shown 
that 21 nucleotide siRNA duplexes are most active when containing two nucleotide 3*- 
overhangs. Furthermore, substitution of one or both siRNA strands with 2'-deoxy or 2'-0- 

30 methyl nucleotides abolishes RNAi activity, whereas substitution of 3 '-terminal siRNA 
nucleotides with deoxy nucleotides was shown to be tolerated. Mismatch sequences in the 
center of the siRNA duplex were also shown to abolish RNAi activity. In addition, these 
studies also indicate that the position of the cleavage site in the target RNA is defined by the 
5'-end of the siRNA guide sequence rather than the 3'-end (Elbashir et al, 2001, EMBO J., 

35 20, 6877). Other studies have indicated that a 5'-phosphate on the target-complementary 
strand of a siRNA duplex is required for siRNA activity and that ATP is utilized to maintain 
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the 5'-phosphate moiety on the siRNA (Nykanen et al, 2001, Cell, 107, 309), however 
siRNA molecules lacking a 5'-phosphate are active when introduced exogenously, suggesting 
that 5 '-phosphorylation of siRNA constructs may occur in vivo. 

KnTAmiatic Nucleic Acid : Several varieties of naturally-occurring enzymatic RNAs are 
presently known. In addition, several in vitro selection (evolution) strategies (Orgel, 1979, 
Proc. R. Soc. London, B 205, 435) have been used to evolve new nucleic acid catalysts 
capable of catalyzing cleavage and ligation of phosphodiester linkages (Joyce, 1989, Gene, 
82, 83-87; Beaudry et al, 1992, Science 257, 635-641; Joyce, 1992, Scientific American 267, 
90-97; Breaker et al, 1994, TIBTECH 12, 268; Bartel et a/., 1993, Science 261:1411-1418; 
Szostak, 1993, TIBS 17, 89-93; Kumar et al, 1995, FASEB J., 9, 1183; Breaker, 1996, Curr. 
Op. Biotech., 7, 442; Santoro et al, 1997, Proc. Natl Acad. Sci., 94, 4262; Tang et al, 1997, 
RNA 3, 914; Nakamaye & Eckstein, 1994, supra; Long & Uhlenbeck, 1994, supra; Ishizaka 
et al, 1995, supra; Vaish et al, 1997, Biochemistry 36, 6495; all of these are incorporated by 
reference herein). Each can catalyze a series of reactions including the hydrolysis of 
phosphodiester bonds in trans (and thus can cleave other RNA molecules) under 
physiological conditions. 

Nucleic acid molecules of this invention can modulate, e.g., down-regulate, Ras protein 
expression and can be used to treat disease or diagnose disease associated with the levels of 
Ras, HTV and/or HER2. Enzymatic nucleic acid sequences targeting Ras, HIV and/or HER2 
RNA and sequences that can be targeted with nucleic acid molecules of the invention to 
down-regulate Ras expression are shown in the Tables herein. 

The enzymatic nature of an enzymatic nucleic acid molecule allows the concentration 
of enzymatic nucleic acid molecule necessary to affect a therapeutic treatment to be lower 
than a nucleic acid molecule lacking enzymatic activity. This reflects the ability of the 
enzymatic nucleic acid molecule to act enzymatically. Thus, a single enzymatic nucleic acid 
molecule is able to cleave many molecules of target RNA. In addition, the enzymatic nucleic 
acid molecule is a highly specific inhibitor, with the specificity of inhibition depending not 
only on the base-pairing mechanism of binding to the target RNA but also on the mechanism 
of target RNA cleavage. Single mismatches, or base-substitutions, near the site of cleavage 
can be chosen to completely eliminate catalytic activity of a enzymatic nucleic acid molecule. 

Nucleic acid molecules having an endonuclease enzymatic activity are able to 
repeatedly cleave other separate RNA molecules in a nucleotide base sequence-specific 
manner. With proper design and construction, such enzymatic nucleic acid molecules can be 
targeted to virtually any RNA transcript, and achieve efficient cleavage in vitro (Zaug et al, 
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324, Nature 429 1986; Uhlenbeck, 1987 Nature 328, 596; Kim et al, 84 Proc. Natl Acad. 
Sci. USA 8788, 1987; Dreyfus, 1988, Einstein Quart J. Bio. Med., 6, 92; Haseloff and 
Gerlach, 334 Nature 585, 1988; Cech, 260 JAMA 3030, 1988; and Jefferies et al, 17 Nucleic 
Acids Research 1371, 1989; Santoro etal, 1997 supra). 

Because of their sequence specificity, fra/w-cleaving enzymatic nucleic acid molecules 
can be used as therapeutic agents for human disease (Usman & McSwiggen, 1995 Ann. Rep. 
Med. Chem. 30, 285-294; Christoffersen and Marr, 1995 J. Med, Chem. 38, 2023-2037). 
Enzymatic nucleic acid molecules can be designed to cleave specific RNA targets within the 
background of cellular RNA. Such a cleavage event renders the RNA non-functional and 
abrogates protein expression from that RNA. In this manner, synthesis of a protein associated 
with a disease state can be selectively inhibited (Warashina et al, 1999, Chemistry and 
Biology, 6, 237-250). 

Enzymatic nucleic acid molecules of the invention that are allosterically regulated 
("allozymes") can be used to modulate, including down-regulate, Ras, HIV and/or HER2 
1 5 expression. These allosteric enzymatic nucleic acids or allozymes (see for example George et 
al, US Patent Nos. 5,834,186 and 5,741,679, Shih et al, US Patent No. 5,589,332, Nathan et 
al, US Patent No 5,871,914, Nathan and Ellington, International PCT publication No. WO 
00/24931, Breaker et al, International PCT Publication Nos. WO 00/26226 and 98/27104, 
and Sullenger et al, International PCT publication No. WO 99/29842) are designed to 
20 respond to a signaling agent, for example, mutant Ras, HTV and/or HER2 protein, wild-type 
Ras, HIV and/or HER2 protein, mutant Ras, HIV and/or HER2 RNA, wild-type Ras, HIV 
and/or HER2 RNA, other proteins and/or RNAs involved in Ras, HIV and/or HER2 activity, 
compounds, metals, polymers, molecules and/or drugs that are targeted to Ras, HTV and/or 
HER2 expressing cells etc., which, in turn, modulate the activity of the enzymatic nucleic 
25 acid molecule. In response to interaction with a predetermined signaling agent, the activity of 
the allosteric enzymatic nucleic acid molecule is activated or inhibited such that the 
expression of a particular target is selectively regulated, including down-regulated. The target 
can comprise wild-type Ras, HIV and/or HER2, mutant Ras, HIV and/or HER2, a component 
of Ras, HTV and/or HER2, and/or a predetermined cellular component that modulates Ras, 
30 HIV and/or HER2 activity. For example, allosteric enzymatic nucleic acid molecules that are 
activated by interaction with a RNA encoding Ras, HTV and/or HER2 protein can be used as 
therapeutic agents in vivo. The presence of RNA encoding the Ras, HIV and/or HER2 
protein activates the allosteric enzymatic nucleic acid molecule that subsequently cleaves the 
RNA encoding Ras, HTV and/or HER2 protein, resulting in the inhibition of Ras, HTV and/or 
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HER2 protein expression. In this manner, cells that express the Ras, HIV and/or HER2 
protein are selectively targeted. 

In another non-limiting example, an allozyme can be activated by a Ras, HIV and/or 
HER2 protein, peptide, or mutant polypeptide that causes the allozyme to inhibit the 
expression of Ras, HIV and/or HER2 gene, by, for example, cleaving RNA encoded by Ras, 
HIV and/or HER2 gene. In this non-limiting example, the allozyme acts as a decoy to inhibit 
the function of Ras, HIV and/or HER2 and also inhibit the expression of Ras, HTV and/or 
HER2 once activated by the Ras, HIV and/or HER2 protein. 

Target sites 

Targets for useful enzymatic nucleic acid molecules and antisense nucleic acids can be 
determined as disclosed in Draper et aL, WO 93/23569; Sullivan et al, WO 93/23057; 
Thompson et aL, WO 94/02595; Draper et aL, WO 95/04818; McSwiggen et aL, US Patent 
No. 5,525,468, and hereby incorporated by reference herein in totality. Other examples 
include the following PCT applications, which concern inactivation of expression of disease- 
related genes: WO 95/23225, WO 95/13380, WO 94/02595, incorporated by reference herein. 
Rather than repeat the guidance provided in those documents here, below are provided 
specific non-limiting examples of such methods. Enzymatic nucleic acid molecules to such 
targets are designed as described in the above applications and synthesized to be tested in 
vitro and in vivo, as also described. The sequences of human K-Ras, H-Ras, HTV-1 and HER2 
RNAs were screened for optimal enzymatic nucleic acid target sites using a computer-folding 
algorithm. Nucleic acid molecule binding/cleavage sites were identified. These sites are 
shown in the Tables (all sequences are 5' to 3' in the tables). The nucleotide base position is 
noted in the Tables as that site to be cleaved by the designated type of enzymatic nucleic acid 
molecule. Human sequences can be screened and enzymatic nucleic acid molecule and/or 
antisense thereafter designed, as discussed in Stinchcomb et aL, WO 95/23225. In addition, 
mouse targeted nucleic acid molecules can be used to test efficacy of action of the enzymatic 
nucleic acid molecule, siRNA and/or antisense prior to testing in humans. 

In addition, enzymatic nucleic acid, siRNA, and antisense nucleic acid molecule 
binding/cleavage sites were identified. The nucleic acid molecules are individually analyzed 
by computer folding (Jaeger et al., 1989 Proa Natl. Acad. Set USA, 86, 7706) to assess 
whether the sequences fold into the appropriate secondary structure. Those nucleic acid 
molecules with unfavorable intramolecular interactions, such as between, for example the 
binding aims and the catalytic core of an enzymatic nucleic acid, are eliminated from 
consideration. Varying binding arm lengths can be chosen to optimize activity. 
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Antisense, hammerhead, DNAzyme, NCH, amberzyme, zinzyme or G-Cleaver 
enzymatic nucleic acid molecule, siRNA, and antisense nucleic acid binding/cleavage sites 
were identified and were designed to anneal to various sites in the RNA target. The 
enzymatic nucleic acid binding arms or siRNA and antisense nucleic acid sequences are 
5 complementary to the target site sequences described above. The nucleic acid molecules are 
chemically synthesized. The method of synthesis used follows the procedure for normal 
DNA/RNA synthesis as described below and in Usman et al, 1987 J. Am. Chem. Soc, 109, 
7845; Scaringe et al, 1990 Nucleic Acids Res., 18, 5433; and Wincott et al, 1995 Nucleic 
Acids Res. 23, 2677-2684; Caruthers et al, 1992, Methods in Enzymology 21 1,3-19. 

10 Synthesis of Nucleic acid Molecules 

Synthesis of nucleic acids greater than 100 nucleotides in length can be difficult using 
automated methods, and the therapeutic cost of such molecules can be prohibitive. In this 
invention, small nucleic acid motifs ("small" refers to nucleic acid motifs less than about 100 
nucleotides in length, preferably less than about 80 nucleotides in length, and more preferably 
15 less than about 50 nucleotides in length; e.g., DNAzymes) are preferably used for exogenous 
delivery. The simple structure of these molecules increases the ability of the nucleic acid to 
invade targeted regions of RNA structure. Exemplary molecules of the instant invention are 
chemically synthesized as described herein, and others can similarly be synthesized. 

Oligonucleotides (e.g., DNAzymes, antisense) are synthesized using protocols known 
20 in the art as described in Caruthers et al, 1992, Methods in Enzymology 211, 3-19, Thompson 
et al, International PCT Publication No. WO 99/54459, Wincott et al, 1995, Nucleic Acids 
Res. 23, 2677-2684, Wincott et al, 1997, Methods Mol. Bio., 74, 59, Brennan et al, 1998, 
Biotechnol Bioeng., 61, 33-45, and Brennan, US patent No. 6,001,311. All of these 
references are incorporated herein by reference. The synthesis of oligonucleotides makes use 
25 of common nucleic acid protecting and coupling groups, such as dimethoxytrityl at the 5*-end, 
and phosphoramidites at the 3 '-end. In a non-limiting example, small scale syntheses are 
conducted on a 394 Applied Biosystems, Inc. synthesizer using a 0.2 jimol scale protocol 
with a 2.5 min coupling step for 2'-0-methylated nucleotides and a 45 sec coupling step for 
2'-deoxy nucleotides. Table I outlines the amounts and the contact times of the reagents 
30 used in the synthesis cycle. Alternatively, syntheses at the 0.2 jimol scale can be performed 
on a 96-well plate synthesizer, such as the instrument produced by Protogene (Palo Alto, CA) 
with minimal modification to the cycle. A 33-fold excess (60 |iL of 0.1 1 M = 6.6 ^mol) of 
2'-0-methyl phosphoramidite and a 105-fold excess of S-ethyl tetrazole (60 ^iL of 0.25 M = 
15 jxmol) can be used in each coupling cycle of 2'-0-methyl residues relative to polymer- 
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bound 5'-hydroxyl. A 22-fold excess (40 uL of 0.11 M = 4.4 umol) of deoxy 
phosphoramidite and a 70-fold excess of S-ethyl tetrazole (40 uL of 0.25 M = 10 umol) can 
be used in each coupling cycle of deoxy residues relative to polymer-bound 5'-hydroxyl. 
Average coupling yields on the 394 Applied Biosystems, Inc. synthesizer, determined by 
colorimetric quantitation of the trityl fractions, are typically 97.5-99%. Other oligonucleotide 
synthesis reagents for the 394 Applied Biosystems, Inc. synthesizer include; detritylation 
solution is 3% TCA in methylene chloride (ABI); capping is performed with 16% AT-methyl 
imidazole in THF (ABI) and 10% acetic anhydride/10% 2,6-lutidine in THF (ABI); and 
oxidation solution is 16.9 mM I 2 , 49 mM pyridine, 9% water in THF (PERSEPTIVE™). 
Burdick & Jackson Synthesis Grade acetonitrile is used directly from the reagent bottle. S- 
Ethyltetrazole solution (0.25 M in acetonitrile) is made up from the solid obtained from 
American International Chemical, Inc. Alternately, for the introduction of phosphorothioate 
linkages, Beaucage reagent (3H-l,2-Benzodithiol-3-one 1,1-dioxide, 0.05 M in acetonitrile) is 
used. 

Deprotection of the DNAzymes is performed as follows: the polymer-bound trityl-on 
oligoribonucleotide is transferred to a 4 mL glass screw top vial and suspended in a solution 
of 40% aq. methylamine (1 mL) at 65 °C for 10 min. After cooling to -20 °C, the supernatant 
is removed from the polymer support. The support is washed three times with 1.0 mL of 
EtOH:MeCN:H20/3:l:l, vortexed and the supernatant is then added to the first supernatant. 
The combined supernatants, containing the oligoribonucleotide, are dried to a white powder. 

The method of synthesis used for RNA and chemically modified RNA or DNA, 
including certain enzymatic nucleic acid molecules and siRNA molecules, follows the 
procedure as described in Usman et al, 1987, J. Am. Chem. Soc, 109, 7845; Scaringe et al, 
1990, Nucleic Acids Res., 18, 5433; and Wincott et al, 1995, Nucleic Acids Res. 23, 2677- 
2684 Wincott et al, 1997, Methods Mol. Bio., 74, 59, and makes use of common nucleic acid 
protecting and coupling groups, such as dimethoxytrityl at the 5'-end, and phosphoramidites 
at the 3'-end. In a non-hmiting example, small scale syntheses are conducted on a 394 
Applied Biosystems, Inc. synthesizer using a 0.2 umol scale protocol with a 7.5 min coupling 
step for alkylsilyl protected nucleotides and a 2.5 min coupling step for 2'-0-methylated 
nucleotides. Table I outlines the amounts and the contact times of the reagents used in the 
synthesis cycle. Alternatively, syntheses at the 0.2 umol scale can be done on a 96-well plate 
synthesizer, such as the instrument produced by Protogene (Palo Alto, CA) with niinimal 
modification to the cycle. A 33-fold excess (60 uL of 0.11 M = 6.6 umol) of 2'-0-methyl 
phosphoramidite and a 75-fold excess of S-ethyl tetrazole (60 uL of 0.25 M = 15 umol) can 
be used in each coupling cycle of 2'-0-methyl residues relative to polymer-bound 5'- 
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hydroxyl. A 66-fold excess (120 ^tL of 0.11 M = 13.2 |nmol) of alkylsilyl (ribo) protected 
phosphoramidite and a 150-fold excess of S-ethyl tetrazole (120 \iL of 0.25 M = 30 pmol) 
can be used in each coupling cycle of ribo residues relative to polymer-bound 5'-hydroxyl. 
• Average coupling yields on the 394 Applied Biosystems, Inc. synthesizer, determined by 
5 colorimetric quantitation of the trityl fractions, are typically 97.5-99%. Other oligonucleotide 
synthesis reagents for the 394 Applied Biosystems, Inc. synthesizer include; detritylation 
solution is 3% TCA in methylene chloride (ABI); capping is performed with 16% iV-methyl 
imidazole in THF (ABI) and 10% acetic anhydride/10% 2,6-lutidine in THF (ABI); oxidation 
solution is 16.9 mM I 2 , 49 mM pyridine, 9% water in THF (PERSEPTIVE™). Burdick & 

1 0 Jackson Synthesis Grade acetonitrile is used directly from the reagent bottle. S-Ethyltetrazole 
solution (0.25 M in acetonitrile) is made up from the solid obtained from American 
International Chemical, Inc. Alternately, for the introduction of phosphorothioate linkages, 
Beaucage reagent (3H-l,2-Benzodithiol-3-one 1,1 -dioxide 0.05 M in acetonitrile) is used. 

Deprotection of the RNA is performed using either a two-pot or one-pot protocol. For 
1 5 the two-pot protocol, the polymer-bound trityl-on oligoribonucleotide is transferred to a 4 mL 
glass screw top vial and suspended in a solution of 40% aq. methylamine (1 mL) at 65 °C for 
10 min. After cooling to —20 °C, the supernatant is removed from the polymer support. The 
support is washed three times with 1.0 mL of EtOH:MeCN:H20/3:l:l, vortexed and the 
supernatant is then added to the first supernatant. The combined supernatants, containing the 
20 oligoribonucleotide, are dried to a white powder. The base deprotected oligoribonucleotide is 
resuspended in anhydrous TEAyHF/NMP solution (300 jiL of a solution of 1.5 mL N- 

methylpyrrolidinone, 750 ^iL TEA and 1 mL TEAOHF to provide a 1.4 M HF concentration) 
and heated to 65 °C. After 1.5 h, the oligomer is quenched with 1.5 M NH4HCO3. 

Alternatively, for the one-pot protocol, the polymer-bound trityl-on oligoribonucleotide 
25 is transferred to a 4 mL glass screw top vial and suspended in a solution of 33% ethanolic 
methylamine/DMSO: 1/1 (0.8 mL) at 65 °C for 15 min. The vial is brought to r.t. TEA # 3HF 
(0.1 mL) is added and the vial is heated at 65 °C for 15 min. The sample is cooled at —20 °C 
and then quenched with 1 .5 M NH4HCO3. 

For purification of the trityl-on oligomers, the quenched NH4HCO3 solution is loaded 
30 onto a C-18 containing cartridge that had been prewashed with acetonitrile followed by 50 
mM TEAA. After washing the loaded cartridge with water, the RNA is detritylated with 
0.5% TFA for 13 min. The cartridge is then washed again with water, salt exchanged with 1 
M NaCl and washed with water again. The oligonucleotide is then eluted with 30% 
acetonitrile. 
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Inactive nucleic acid molecules or binding attenuated control (BAC) oligonucleotides 
can be synthesized by substituting one or more nucleotides in the nucleic acid molecule to 
inactivate the molecule and such molecules can serve as a negative control. 

The average stepwise coupling yields are typically >98% (Wincott et aL, 1995 Nucleic 
5 Acids Res. 23, 2677-2684). Those of ordinary skill in the art will recognize that the scale of 
synthesis can be adapted to be larger or smaller than the example described above including 
but not limited to 96 well format, all that is important is the ratio of chemicals used in the 
reaction. 

Alternatively, the nucleic acid molecules of die present invention can be synthesized 
1 0 separately and joined together post-synthetically, for example by ligation (Moore et aL, 1992, 
Science 256, 9923; Draper et aL, International PCT publication No. WO 93/23569; 
Shabarova et aL, 1991, Nucleic Acids Research 19, 4247; Bellon et aL, 1997, Nucleosides & 
Nucleotides, 16, 951; Bellon et aL, 1997, Bioconjugate Chem. 8, 204). 

The nucleic acid molecules of the present invention can be modified extensively to 
1 5 enhance stability by modification with nuclease resistant groups, for example, 2 f -amino, 2-C- 
allyl, 2'-flouro, 2 , -0-methyl, 2'-H (for a review see Usman and Cedergren, 1992, JIBS 17, 34; 
Usman et aL, 1994, Nucleic Acids Symp. Ser. 31, 163). Enzymatic nucleic acid molecules are 
purified by gel electrophoresis using known methods or are purified by high pressure liquid 
chromatography (HPLC; See Wincott et aL, Supra, the totality of which is hereby 
20 incorporated herein by reference) and are re-suspended in water. 

The sequences of the nucleic acid molecules, including enzymatic nucleic acid 
molecules and antisense, that are chemically synthesized, are shown in the Tables herein. 
These sequences are representative only of many more such sequences where the enzymatic 
portion of the enzymatic nucleic acid molecule (all but the binding arms) is modified to affect 
25 activity. For example, the enzymatic nucleic acid sequences listed in the Tables can be 
formed of deoxyribonucleotides or other nucleotides or non-nucleotides. Such enzymatic 
nucleic acid molecules with enzymatic activity are equivalent to the enzymatic nucleic acid 
molecules described specifically in the Tables. 

Optimizing Activity of the Nucleic Acid Molecule of the Invention. 

30 Chemically synthesizing nucleic acid molecules with modifications (base, sugar and/or 

phosphate) that prevent their degradation by serum ribonucleases can increase their potency 
(see e.g., Eckstein et al, International Publication No. WO 92/07065; Perrault et aL, 1990 
Nature 344, 565; Pieken et aL, 1991, Science 253, 314; Usman and Cedergren, 1992, Trends 
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in Biochem. Sci. 17, 334; Usman et al, International Publication No. WO 93/15187; and 
Rossi et al, International Publication No. WO 91/03162; Sproat, US Patent No. 5,334,711; 
and Burgin et al, supra, all of which are hereby incorporated by reference in their entirety). 
All of the above references describe various chemical modifications that can be made to the 
5 base, phosphate and/or sugar moieties of the nucleic acid molecules described herein. 
Modifications which enhance their efficacy in cells, and removal of bases from nucleic acid 
molecules to shorten oligonucleotide synthesis times and reduce chemical requirements are 
desired. 

There are several examples of sugar, base and phosphate modifications that can be 

10 introduced into nucleic acid molecules with significant enhancement in their nuclease 
-stability and efficacy. For example, oligonucleotides can be modified to enhance stability 
and/or enhance biological activity by modification with nuclease resistant groups, for 
example, 2 , -amino, 2-C-allyl, 2 f -flouro, 2-0-methyl, 2*-H, nucleotide base modifications (for 
a review see Usman and Cedergren, 1992, TIBS. 17, 34; Usman et al, 1994, Nucleic Acids 

15 Symp. Ser. 31, 163; Burgin et al, 1996, Biochemistry , 35, 14090). Sugar modification of 
nucleic acid molecules are also known to increase efficacy (see Eckstein et al, International 
Publication PCT No. WO 92/07065; Perrault et al Nature, 1990, 344, 565-568; Pieken et 
al Science, 1991, 253, 314-317; Usman and Cedergren, Trends in Biochem. Set , 1992, 17, 
334-339; Usman et al International Publication PCT No. WO 93/15187; Sproat, US Patent 

20 No. 5,334,711 and Beigelman et al, 1995, J. Biol Chem., 270, 25702; Beigelman et al, 
International PCT publication No. WO 97/26270; Beigelman et al, US Patent No. 5,716,824; 
Usman et al, US patent No. 5,627,053; Woolf et al, International PCT Publication No. WO 
98/13526; Thompson et al, USSN 60/082,404 which was filed on April 20, 1998; Karpeisky 
et al, 1998, Tetrahedron Lett, 39, 1131; Earnshaw and Gait, 1998, Biopolymers (Nucleic 

25 acid Sciences), 48, 39-55; Verma and Eckstein, 1998, Annu. Rev. Biochem., 67, 99-134; and 
Burlina et al, 1997, Bioorg. Med. Chem., 5, 1999-2010; all of the references are hereby 
incorporated in their totality by reference herein). The publications describe general methods 
and strategies to determine the location of incorporation of sugar, base and/or phosphate 
* modifications and the like into enzymatic nucleic acid molecules without inhibiting catalysis. 

30 Similar modifications can be used as described herein to modify the nucleic acid molecules of 
the instant invention. 

While chemical modification of oligonucleotide intemucleotide linkages with 
phosphorothioate, phosphorothioate, and/or 5'-methylphosphonate linkages improves 
stability, excessive modifications can cause some toxicity. Therefore, when designing nucleic 
35 acid molecules, the amount of these intemucleotide linkages should be minimized. The 
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reduction in the concentration of these linkages can lower toxicity, resulting in increased 
efficacy and higher specificity of the therapeutic nucleic acid molecules. 

Nucleic acid molecules having chemical modifications that maintain or enhance activity 
are provided. Such nucleic acid molecules are also generally more resistant to nucleases than 
5 unmodified nucleic acid molecules. Thus, the in vitro and/or in vivo activity should not be 
significantly lowered. Therapeutic nucleic acid molecules delivered exogenously are 
optimally stable within cells until translation of the target RNA has been inhibited long 
enough to reduce the levels of the undesirable protein. This period of time varies between 
hours to days, depending upon the disease state. Nucleic acid molecules are preferably 
10 resistant to nucleases in order to function as effective intracellular therapeutic agents. 
Improvements in the chemical synthesis of RNA and DNA (Wincott et al, 1995 Nucleic 
Acids Res. 23, 2677; Caruthers et al., 1992, Methods in Enzymology 211,3-19 (incorporated 
by reference herein)) have expanded the ability to modify nucleic acid molecules by 
introducing nucleotide modifications to enhance their nuclease stability as described above. 

15 In one embodiment, nucleic acid molecules of the invention include one or more G- 

clamp nucleotides. A G-clamp nucleotide is a modified cytosine analog wherein 
modifications result in the ability to hydrogen bond both Watson-Crick and Hoogsteen faces 
of a complementary guanine within a duplex, see for example Lin and Matteucci, 1998, J. 
Am. Chem. Soc, 120, 8531-8532. A single G-clamp analog substation within an 

20 oligonucleotide can result in substantially enhanced helical thermal stability and mismatch 
discrimination when hybridized to complementary oligonucleotides. The inclusion of such 
nucleotides in nucleic acid molecules of the invention can enable both enhanced affinity and 
specificity to nucleic acid targets. 

In another embodiment, the invention features conjugates and/or complexes of nucleic 
25 acid molecules targeting Ras genes such as K-Ras, H-Ras, and/or N-Ras. Compositions and 
conjugates are used to facilitate delivery of molecules into a biological system, such as cells. 
The conjugates provided by the instant invention can impart therapeutic activity by 
transferring therapeutic compounds across cellular membranes, altering the pharmacokinetics, 
and/or modulating the localization of nucleic acid molecules of the invention. The present 
30 invention encompasses the design and synthesis of novel agents for the delivery of molecules, 
including but not limited to, small molecules, lipids, phospholipids, nucleosides, nucleotides, 
nucleic acids, antibodies, toxins, negatively charged polymers and other polymers, for 
example proteins, peptides, hormones, carbohydrates, polyethylene glycols, or polyamines, 
across cellular membranes. In general, the transporters described are designed to be used 
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either individually or as part of a multi-component system, with or without degradable 
linkers. These compounds are expected to improve delivery and/or localization of nucleic 
acid molecules of the invention into a number of cell types originating from different tissues, 
in the presence or absence of serum (see Sullenger and Cech, US 5,854,038). Conjugates of 
5 the molecules described herein can be attached to biologically active molecules via linkers 
that are biodegradable, such as biodegradable nucleic acid linker molecules. 

The term "biodegradable nucleic acid linker molecule" as used herein, refers to a 
nucleic acid molecule that is designed as a biodegradable linker to connect one molecule to 
another molecule, for example, a biologically active molecule. The stability of the 

10 biodegradable nucleic acid linker molecule can be modulated by using various combinations 
of ribonucleotides, deoxyribonucleotides, and chemically modified nucleotides, for example 
2'-0-methyl, 2'-fluoro, 2'-amino, 2'-0-amino, 2'-C-allyl, 2*-0-allyl, and other 2>-modified 
or base modified nucleotides. The biodegradable nucleic acid linker molecule can be a dimer, 
trimer, tetramer or longer nucleic acid molecule, for example, an oligonucleotide of about 2, 

15 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, or 20 nucleotides in length, or can 
comprise a single nucleotide with a phosphorus based linkage, for example, a 
phosphoramidate or phosphodiester linkage. The biodegradable nucleic acid linker molecule 
can also comprise nucleic acid backbone, nucleic acid sugar, or nucleic acid base 
modifications. 

20 The term "biodegradable" as used herein, refers to degradation in a biological system, 

for example, enzymatic degradation or chemical degradation. 

The term "biologically active molecule" as used herein, refers to compounds or 
molecules that are capable of eliciting or modifying a biological response in a system. Non- 
limiting examples of biologically active molecules contemplated by the instant invention 

25 include therapeutically active molecules such as antibodies, hormones, antivirals, peptides, 
proteins, chemotherapeutics, small molecules, vitamins, co-factors, nucleosides, nucleotides, 
oligonucleotides, enzymatic nucleic acids, antisense nucleic acids, triplex forming 
oligonucleotides, 2,5-A chimeras, siRNA, dsRNA, allozymes, aptamers, decoys and analogs 
thereof Biologically active molecules of the invention also include molecules capable of 

30 modulating the pharmacokinetics and/or pharmacodynamics of other biologically active 
molecules, for example lipids and polymers such as polyamines, polyamides, polyethylene 
glycol and other polyethers. 

The term "phospholipid" as used herein, refers to a hydrophobic molecule comprising 
at least one phosphorus group. For example, a phospholipid can comprise a phosphorus 
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containing group and saturated or unsaturated alkyl group, optionally substituted with OH, 
COOH, oxo, amine, or substituted or unsubstituted aryl groups. 

Use of the nucleic acid-based molecules of the invention can lead to better treatment of 
the disease progression by affording the possibility of combination therapies (e.g., multiple 
5 antisense or enzymatic nucleic acid molecules targeted to different genes, nucleic acid 
molecules coupled with known small molecule inhibitors, or intermittent treatment with 
combinations of molecules (including different motifs) and/or other chemical or biological 
molecules). The treatment of subjects with nucleic acid molecules can also include 
combinations of different types of nucleic acid molecules. 

10 In the case that down-regulation of the target is desired, therapeutic nucleic acid 

molecules (e.g., DNAzymes) delivered exogenously are optimally stable within cells until 
translation of the target RNA has been inhibited long enough to reduce the levels of the 
targeted protein. This period of time varies between hours to days depending upon the 
disease state. These nucleic acid molecules should be resistant to nucleases in order to 

1 5 function as effective intracellular therapeutic agents. Improvements in the chemical synthesis 
of nucleic acid molecules described in the instant invention and others known in the art have 
expanded the ability to modify nucleic acid molecules by introducing nucleotide 
modifications to enhance their nuclease stability as described above. 

In another embodiment, nucleic acid catalysts having chemical modifications that 
20 maintain or enhance enzymatic activity are provided. Such nucleic acids are also generally 
more resistant to nucleases than unmodified nucleic acid. Thus, the in vitro and/or in vivo 
the activity of the nucleic acid should not be significantly lowered. As exemplified herein, 
such enzymatic nucleic acids are useful for in vitro and/or in vivo techniques even if activity 
over all is reduced 10 fold (Burgin et al., 1996, Biochemistry, 35, 14090). Such enzymatic 
25 nucleic acids herein are said to "maintain" the enzymatic activity of an all RNA ribozyme or 
all DNA DNAzyme. 

In another aspect the nucleic acid molecules comprise a 5' and/or a 3'- cap structure. 

By "cap structure" is meant chemical modifications, which have been incorporated at 
either terminus of the oligonucleotide (see, for example, Wincott et al, WO 97/26270, 
30 incorporated by reference herein). These terminal modifications protect the nucleic acid 
molecule from exonuclease degradation, and can help in delivery and/or localization within a 
cell. The cap can be present at the 5' -terminus (5'-cap) or at the 3' -terminus (3 '-cap) or can 
be present on both termini. In non-limiting examples, the 5 '-cap includes inverted abasic 
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residue (moiety), 4 l ,5 , -methylene nucleotide; l-(beta-D-erythrofuranosyl) nucleotide, 4-thio 
nucleotide, carbocyclic nucleotide; 1,5-anhydrohexitol nucleotide; L-nucleotides; alpha- 
nucleotides; modified base nucleotide; phosphorodithioate linkage; tkreo-pentofuranosyl 
nucleotide; acyclic r3',4'-seco nucleotide; acyclic 3,4-dihydroxybutyl nucleotide; acyclic 3,5- 
dihydroxypentyl nucleotide, 3-3 -inverted nucleotide moiety; S-S'-inverted abasic moiety; 3- 
2'-inverted nucleotide moiety; 3 t -2 , -inverted abasic moiety; 1,4-butanediol phosphate; 3- 
phosphoramidate; hexylphosphate; aminohexyl phosphate; 3 f -phosphate; 3-phosphorothioate; 
phosphorodithioate; or bridging or non-bridging methylphosphonate moiety (for more details 
see Wincott et al, International PCT publication No. WO 97/26270, incorporated by 
reference herein). 

In another embodiment, the 3'-cap includes, for example 4 , ,5 f -methylene nucleotide; 1- 
(bcta-D-erythrofuranosyl) nucleotide; 4-thio nucleotide, carbocyclic nucleotide; 5'-amino- 
alkyl phosphate; l,3-diamino-2-propyl phosphate, 3-aminopropyl phosphate; 6-aminohexyl 
phosphate; 1,2-aminododecyl phosphate; hydroxypropyl phosphate; 1,5-anhydrohexitol 
nucleotide; L-nucleotide; alpha-nucleotide; modified base nucleotide; phosphorodithioate; 
///reo-pentofuranosyl nucleotide; acyclic 3 f ,4 f -seco nucleotide; 3,4-dihydroxybutyl nucleotide; 
3,5-dihydroxypentyl nucleotide, 5-5-inverted nucleotide moiety; 5-5 -inverted abasic moiety; 
S'-phosphoramidate; 5-phosphorothioate; 1,4-butanediol phosphate; 5-amino; bridging 
and/or non-bridging 5-phosphoramidate, phosphorothioate and/or phosphorodithioate, 
bridging or non bridging methylphosphonate and 5-mercapto moieties (for more details see 
Beaucage and Iyer, 1993, Tetrahedron 49, 1925; incorporated by reference herein). 

By the term "non-nucleotide" is meant any group or compound which can be 
incorporated into a nucleic acid chain in the place of one or more nucleotide units, including 
either sugar and/or phosphate substitutions, and allows the remaining bases to exhibit their 
enzymatic activity. The group or compound is abasic in that it does not contain a commonly 
recognized nucleotide base, such as adenosine, guanine, cytosine, uracil or thymine. 

The term "alkyl" as used herein refers to a saturated aliphatic hydrocarbon, including 
straight-chain, branched-chain "isoalkyl", and cyclic alkyl groups. The term "alkyl" also 
comprises alkoxy, alkyl-thio, alkyl-thio-alkyl, alkoxyalkyl, alkylamino, alkenyl, alkynyl, 
alkoxy, cycloalkenyl, cycloalkyl, cycloalkylalkyl, heterocycloalkyl, heteroaryl, C1-C6 
hydrocarbyl, aryl or substituted aryl groups. Preferably, the alkyl group has 1 to 12 carbons. 
More preferably it is a lower alkyl of from about 1 to 7 carbons, more preferably about 1 to 4 
carbons. The alkyl group can be substituted or unsubstituted. When substituted the 
substituted group(s) preferably comprise hydroxy, oxy, thio, amino, nitro, cyano, alkoxy, 
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alkyl-thio, alkyl-thio-alkyl, alkoxyalkyl, alkylamino, silyl, alkenyl, alkynyl, alkoxy, 
cycloalkenyl, cycloalkyl, cycloalkylalkyl, heterocycloalkyl, heteroaryl, C1-C6 hydrocarbyl, 
aryl or substituted aryl groups. The term "alkyl" also includes alkenyl groups containing at 
least one carbon-carbon double bond, including straight-chain, branched-chain, and cyclic 
groups. Preferably, the alkenyl group has about 2 to 12 carbons. More preferably it is a lower 
alkenyl of from about 2 to 7 carbons, more preferably about 2 to 4 carbons. The alkenyl 
group can be substituted or unsubstituted. When substituted the substituted group(s) 
preferably comprise hydroxy, oxy, thio, amino, nitro, cyano, alkoxy, alkyl-thio, alkyl-thio- 
alkyl, alkoxyalkyl, alkylamino, silyl, alkenyl, alkynyl, alkoxy, cycloalkenyl, cycloalkyl, 
cycloalkylalkyl, heterocycloalkyl, heteroaryl, C1-C6 hydrocarbyl, aryl or substituted aryl 
groups. 

The term "alkyl" also includes alkynyl groups containing at least one carbon-carbon 
triple bond, including straight-chain, branched-chain, and cyclic groups. Preferably, the 
alkynyl group has about 2 to 12 carbons. More preferably it is a lower alkynyl of from about 
2 to 7 carbons, more preferably about 2 to 4 carbons. The alkynyl group can be substituted or 
unsubstituted. When substituted the substituted group(s) preferably comprise hydroxy; oxy, 
thio, amino, nitro, cyano, alkoxy, alkyl-thio, alkyl-thio-alkyl, alkoxyalkyl, alkylamino, silyl, 
alkenyl, alkynyl, alkoxy, cycloalkenyl, cycloalkyl, cycloalkylalkyl, heterocycloalkyl, 
heteroaryl, C1-C6 hydrocarbyl, aryl or substituted aryl groups. Alkyl groups or moieties of 
the invention can also include aryl, alkylaryl, carbocyclic aryl, heterocyclic aryl, amide and 
ester groups. The preferred substituent(s) of aryl groups are halogen, trihalomethyl, hydroxyl, 
SH, OH, cyano, alkoxy, alkyl, alkenyl, alkynyl, and amino groups. An "alkylaryl" group 
refers to an alkyl group (as described above) covalently joined to an aryl group (as described 
above). Carbocyclic aryl groups are groups wherein the ring atoms on the aromatic ring are 
all carbon atoms. The carbon atoms are optionally substituted. Heterocyclic aryl groups are 
groups having from about 1 to 3 heteroatoms as ring atoms in the aromatic ring and the 
remainder of the ring atoms are carbon atoms. Suitable heteroatoms include oxygen, sulfur, 
and nitrogen, and include furanyl, thienyl, pyridyl, pyrrolyl, N-lower alkyl pyrrolo, pyrirnidyl, 
pyrazinyl, imidazolyl and the like, all optionally substituted. An "amide" refers to an -C(0)- 
NH-R, where R is either alkyl, aryl alkylaryl or hydrogen. An "ester" refers to an -C(0)-OR', 
where R is either alkyl, aryl, alkylaryl or hydrogen. 

The term "alkoxyalkyl" as used herein refers to an alkyl-O-alkyl ether, for example, 
methoxyethyl or ethoxymethyl. 
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The term "alkyl-thio-alkyl" as used herein refers to an alkyl-S-alkyl thioether, for 
example, methylthiomethyl or methylthioethyl. 

The term "amino" as used herein refers to a nitrogen containing group as is known in 
the art derived from ammonia by the replacement of one or more hydrogen radicals by 
organic radicals. For example, the terms "aminoacyl" and "aminoalkyl" refer to specific N- 
substituted organic radicals with acyl and alkyl substituent groups respectively. 

The term "amination" as used herein refers to a process in which an amino group or 
substituted amine is introduced into an organic molecule. 

The term "exocyclic amine protecting moiety" as used herein refers to a nucleobase 
amino protecting group compatible with oligonucleotide synthesis, for example, an acyl or 
amide group. 

The term "alkenyl" as used herein refers to a straight or branched hydrocarbon of a 
designed number of carbon atoms containing at least one carbon-carbon double bond. 
Examples of "alkenyl" include vinyl, allyl, and 2-methyl-3-heptene. 

The term "alkoxy" as used herein refers to an alkyl group of indicated number of 
carbon atoms attached to the parent molecular moiety through an oxygen bridge. Examples 
of alkoxy groups include, for example, methoxy, ethoxy, propoxy and isopropoxy. 

The term "alkyayl 11 as used herein refers to a straight or branched hydrocarbon of a 
designed number of carbon atoms containing at least one carbon-carbon triple bond. 
Examples of "alkynyl" include propargyl, propyne, and 3-hexyne. 

The term "aryl" as used herein refers to an aromatic hydrocarbon ring system 
containing at least one aromatic ring. The aromatic ring can optionally be fused or otherwise 
attached to other aromatic hydrocarbon rings or non-aromatic hydrocarbon rings. Examples 
of aryl groups include, for example, phenyl, naphthyl, 1,2,3,4-tetrahydronaphthalene and 
biphenyl. Preferred examples of aryl groups include phenyl and naphthyl. 

The term "cycloalkenyl" as used herein refers to a C3-C8 cyclic hydrocarbon 
containing at least one carbon-carbon double bond. Examples of cycloalkenyl include 
cyclopropenyl, cyclobutenyl, cyclopentenyl, cyclopentadiene, cyclohexenyl, 1,3- 
cyclohexadiene, cycloheptenyl, cycloheptatrienyl, and cyclooctenyl. 
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The term "cycloalkyl" as used herein refers to a C3-C8 cyclic hydrocarbon. Examples 
of cycloalkyl include cyclopropyl, cyclobutyl, cyclopentyl, cyclohexyl, cycloheptyl and 
cyclooctyL 

The tenn "cycloalkylalkyl," as used herein, refers to a C3-C7 cycloalkyl group attached 
to the parent molecular moiety through an alkyl group, as defined above. Examples of 
cycloalkylalkyl groups include cyclopropylmethyl and cyclopentylethyl. 

The terms "halogen" or "halo" as used herein refers to indicate fluorine, chlorine, 
bromine, and iodine. 

The term "heterocycloalkyl," as used herein refers to a non-aromatic ring system 
containing at least one heteroatom selected from nitrogen, oxygen, and sulfur. The 
heterocycloalkyl ring can be optionally fused to or otherwise attached to other 
heterocycloalkyl rings and/or non-aromatic hydrocarbon rings. Preferred heterocycloalkyl 
groups have from 3 to 7 members. Examples of heterocycloalkyl groups include, for 
example, piperazine, morpholine, piperidine, tetrahydrofuran, pyrrolidine, and pyrazole. 
Preferred heterocycloalkyl groups include piperidinyl, piperazinyl, morpholinyl, and 
pyrolidinyl. 

The term "heteroaryl" as used herein refers to an aromatic ring system containing at 
least one heteroatom selected from nitrogen, oxygen, and sulfur. The heteroaryl ring can be 
fused or otherwise attached to one or more heteroaryl rings, aromatic or non-aromatic 
hydrocarbon rings or heterocycloalkyl rings. Examples of heteroaryl groups include, for 
example, pyridine, furan, thiophene, 5,6,7,8-tetrahydroisoquinoline and pyrimidine. Preferred 
examples of heteroaryl groups include thienyl, benzothienyl, pyridyl, quinolyl, pyrazinyl, 
pyrimidyl, imidazolyl, benzimidazolyl, furanyl, benzofuranyl, thiazolyl, benzothiazolyl, 
isoxazolyl, oxadiazolyl, isothiazolyl, benzisothiazolyl, triazolyl, tetrazolyl, pyrrolyl, indolyl, 
pyrazolyl, and benzopyrazolyl. 

The term "C1-C6 hydrocarbyl" as used herein refers to straight, branched, or cyclic 
alkyl groups having 1-6 carbon atoms, optionally containing one or more carbon-carbon 
double or triple bonds. Examples of hydrocarbyl groups include, for example, methyl, ethyl, 
propyl, isopropyl, n-butyl, sec-butyl, tert-butyl, pentyl, 2-pentyl, isopentyl, neopentyl, hexyl, 
2-hexyl, 3-hexyl, 3-methylpentyl, vinyl, 2-pentene, cyclopropylmethyl, cyclopropyl, 
cyclohexylmethyl, cyclohexyl and propargyl. When reference is made herein to C1-C6 
hydrocarbyl containing one or two double or triple bonds it is understood that at least two 
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carbons are present in the alkyl for one double or triple bond, and at least four carbons for two 
double or triple bonds. 

By "nucleotide" is meant a heterocyclic nitrogenous base in N-glycosidic linkage with 
a phosphorylated sugar. Nucleotides are recognized in the art to include natural bases 
5 (standard), and modified bases well known in the art. Such bases are generally located at the 
T position of a nucleotide sugar moiety. Nucleotides generally comprise a base, sugar and a 
phosphate group. The nucleotides can be unmodified or modified at the sugar, phosphate 
and/or base moiety, (also referred to interchangeably as nucleotide analogs, modified 
nucleotides, non-natural nucleotides, non-standard nucleotides and other; see for example, 

10 Usman and McSwiggen, snpra\ Eckstein et aL, International PCT Publication No. WO 
~92/07065; Usman et aL, International PCT Publication No. WO 93/15187; Uhlman & 
Peyman, supra all are hereby incorporated by reference herein. There are several examples of 
modified nucleic acid bases known in the art as summarized by Limbach et aL, 1994, Nucleic 
Acids Res. 22, 2183. Some of the non-limiting examples of chemically modified and other 

15 natural nucleic acid bases that can be introduced into nucleic acids include, for example, 
inosine, purine, pyridin-4-one, pyridin-2-one, phenyl, pseudouracil, 2, 4, 6-trimethoxy 
benzene, 3-methyl uracil, dihydrouridine, naphthyl, aminophenyl, 5-alkylcytidines (e.g., 
5-methylcytidine), 5-alkyluridines {e.g., ribothymidine), 5-halouridine (e.g., 5-bromouridine) 
or 6-azapyrimidines or 6-alkylpyrimidines (e.g. 6-methyluridine), propyne, quesosine, 2- 

20 thiouridine, 4-thiouridine, wybutosine, wybutoxosine, 4-acetylcytidine, 5- 
(carboxyhydroxymethyl)uridine, 5 ' -carboxymethylaminomethyl-2-thiouridine, 5- 

carboxymethylaminomethyluridine, beta-D-galactosylqueosine, 1 -methyladenosine, 1 - 
methylinosine, 2,2-dimethylguanosine, 3-methylcytidine, 2-methyladenosine, 2- 
methylguanosine, N6-methyladenosine, 7-methylguanosine, 5 -methoxyaminomethyl-2- 

25 thiouridine, 5-methylaminomethyluridine, 5-methylcarbonylmethyluridine, 5- 
methyloxyuridine, 5-methyl-2-thiouridine, 2-methylthio-N6-isopentenyladenosine, beta-D- 
mannosylqueosine, uridine-5-oxyacetic acid, 2-thiocytidine, threotiine derivatives and others 
(Burgin et aL, 1996, Biochemistry, 35, 14090; Uhlman & Peyman, supra). By "modified 
bases" in this aspect is meant nucleotide bases other than adenine, guanine, cytosine and 

30 uracil at V position or their equivalents; such bases can be used at any position, for example, 
within the catalytic core of an enzymatic nucleic acid molecule and/or in the substrate-binding 
regions of the nucleic acid molecule. 

By "nucleoside" is meant a heterocyclic nitrogenous base in N-glycosidic linkage with 
a sugar. Nucleosides are recognized in the art to include natural bases (standard), and 
35 modified bases well known in the art. Such bases are generally located at the l 1 position of a 



BNSDOCIO <WO. 



.020971 14A2_I_> 



WO 02/097114 



PCT/US02/16840 



52 

nucleoside sugar moiety. Nucleosides generally comprise a base and sugar group. The 
nucleosides can be unmodified or modified at the sugar, and/or base moiety (also referred to 
interchangeably as nucleoside analogs, modified nucleosides, non-natural nucleosides, non- 
standard nucleosides and other; see for example, Usman and McSwiggen, supra; Eckstein et 

5 al, International PCT Publication No. WO 92/07065; Usman et al, International PCT 
Publication No. WO 93/15187; Uhlman & Peyman, supra all are hereby incorporated by 
reference herein). There are several examples of modified nucleic acid bases known in the art 
as summarized by Limbach et al, 1994, Nucleic Acids Res. 22, 2183. Some of the non- 
limiting examples of chemically modified and other natural nucleic acid bases that can be 

10 introduced into nucleic acids include, inosine, purine, pyridin-4-one, pyridin-2-one, phenyl, 
pseudouracil, 2, 4, 6-trimethoxy benzene, 3-methyl uracil, dihydrouridine, naphthyl, 
aminophenyl, 5-alkylcytidines (e.g., 5-methylcytidine), 5-alkyluridines (e.g., ribomymidine), 
5-halouridine (e.g., 5-bromouridine) or 6-azapyrimidines or 6-alkylpyrimidines (e.g. 6- 
methyluridine), propyne, quesosine, 2-thiouridine, 4-thiouridine, wybutosine, wybutoxosine, 

15 4-acetylcytidine, 5-(carboxyhydroxymethyl)uridine, 5'-carboxymemylaminomethyl-2- 
thiouridine, 5-carboxymethylanmiomemyluridine, beta-D-galactosylqueosine, 1- 
methyladenosine, 1-methyfinosine, 2,2-dimethylguanosine, 3-methylcytidine, 2- 
methyladenosine, 2-methylguanosine, N6-methyladenosine, 7-methylguanosine, 5- 
memoxyaminomethyl-2-thiouridine, 5-memylaminomethyluridine, 5- 

20 methylcarbonylmethyluridine, 5-methyloxyuridine, 5-methyl-2-thiouridine, 2-methylthio-N6- 
isopentenyladenosine, beta-D-mannosylqueosine, uridine-5-oxyacetic acid, 2-thiocytidine, 
threonine derivatives and others (Burgin et al., 1996, Biochemistry, 35, 14090; Uhlman & 
Peyman, supra). By "modified bases" in this aspect is meant nucleoside bases other than 
adenine, guanine, cytosine and uracil at 1' position or their equivalents; such bases can be 

25 used at any position, for example, within the catalytic core of an enzymatic nucleic acid 
molecule and/or in the substrate-binding regions of the nucleic acid molecule. 

In one embodiment, the invention features modified enzymatic nucleic acid molecules 
with phosphate backbone modifications comprising one or more phosphorothioate, 
phosphorodithioate, methylphosphonate, morpholino, amidate carbamate, carboxymethyl, 

30 acetamidate, polyamide, sulfonate, sulfonamide, sulfamate, formacetal, thioformacetal, 
and/or alkylsilyl, substitutions. For a review of oligonucleotide backbone modifications see 
Hunziker and Leumann, 1995, Nucleic Acid Analogues: Synthesis and Properties, in Modern 
Synthetic Methods, VCH, 331-417, and Mesmaeker et al., 1994, Novel Backbone 
Replacements for Oligonucleotides, in Carbohydrate Modifications in Antisense Research, 

35 ACS, 24-39. These references are hereby incorporated by reference herein. 



■ Axe 

WO 02/097114 

53 

By "abasic" is meant sugar moieties lacking a base or having other chemical groups in 
place of a base at the V position, for example a 3\3'-linked or 5',5'-linked deoxyabasic 
ribose derivative (for more details see Wincott et al., International PCT publication No. WO 
-£7/26270). 

5 By "unmodified nucleoside" is meant one of the bases adenine, cytosine, guanine, 

thymine, uracil joined to the V carbon of p-D-ribo-fiiranose. 

By "modified nucleoside" is meant any nucleotide base which contains a modification 
in the chemical structure of an unmodified nucleotide base, sugar and/or phosphate. 

In connection with 2 > -modified nucleotides as described for the present invention, by 
10 "amino" is meant 2'-NH 2 or 2'-<9- NH 2 , which can be modified or unmodified. Such 
modified groups are described, for example, in Eckstein et al, U.S. Patent 5,672,695 and 
Matulic-Adamic et al, WO 98/28317, respectively, which are both incorporated by reference 
in their entireties. 

Various modifications to nucleic acid (e.g., DNAzyme) structure can be made to 
1 5 enhance the utility of these molecules. For example, such modifications can enhance shelf- 
life, half-life in vitro, stability, and ease of introduction of such oligonucleotides to the target 
site, including e.g., enhancing penetration of cellular membranes and conferring the ability to 
recognize and bind to targeted cells. 

Use of these molecules can lead to better treatment of the disease progression by 
affording the possibility of combination therapies (e.g., multiple enzymatic nucleic acid 
molecules targeted to different genes, enzymatic nucleic acid molecules coupled with known 
small molecule inhibitors, or intermittent treatment with combinations of enzymatic nucleic 
acid molecules (including different enzymatic nucleic acid molecule motifs) and/or other 
chemical or biological molecules). The treatment of subjects with nucleic acid molecules can 
also include combinations of different types of nucleic acid molecules. Therapies can be 
. devised which include a mixture of enzymatic nucleic acid molecules (including different 
enzymatic nucleic acid molecule motifs), antisense and/or 2-5 A chimera molecules to one or 
more targets to alleviate symptoms of a disease. 

Administration of Nucleic Acid Molecules 

30 Methods for the delivery of nucleic acid molecules are described in Akhtar et al, 1992, 

Trends Cell Bio., 2, 139; and Delivery Strategies for Antisense Oligonucleotide Therapeutics, 
ed. Akhtar, 1995, which are both incorporated herein by reference. Sullivan et al, PCT WO 
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94/02595, further describes the general methods for delivery of enzymatic RNA molecules. 
These protocols can be utilized for the delivery of virtually any nucleic acid molecule. 
Nucleic acid molecules can be adaninistered to cells by a variety of methods known to those 
familiar to the art, including, but not restricted to, encapsulation in liposomes, by 
iontophoresis, or by incorporation into other vehicles, such as hydrogels, cyclodextrins, 
biodegradable nanocapsules, and bioadhesive microspheres. Alternatively, the nucleic 
acid/vehicle combination is locally delivered by direct injection or by use of an infusion 
pump. Other routes of delivery include, but are not limited to oral (tablet or pill form) and/or 
intrathecal delivery (Gold, 1997, Neuroscience, 76, 1153-1158). Other approaches include the 
use of various transport and carrier systems, for example though the use of conjugates and 
biodegradable polymers. For a comprehensive review on drug delivery strategies including 
CNS delivery, see Ho et al, 1999, Curr. Opin. Mol Ther., 1, 336-343 and Jain, Drug 
Delivery Systems: Technologies and Commercial Opportunities, Decision Resources, 1998 
and Groothuis et al, 1997, J. NeuroVirol, 3, 387-400. More detailed descriptions of nucleic 
acid delivery and administration are provided in Sullivan et al, supra, Draper et al, PCT 
W093/23569, Beigelman et al, PCT WO99/05094, and Klimuk et al, PCT WO99/04819, all 
of which have been incorporated by reference herein. 

The molecules of the instant invention can be used as pharmaceutical agents. 
Pharmaceutical agents prevent, inhibit the occurrence, or treat (alleviate a symptom to some 
extent, preferably all of the symptoms) of a disease state in a subject. 

The negatively charged polynucleotides of the invention can be administered (e.g., 
RNA, DNA or protein) and introduced into a subject by any standard means described herein 
and known in the art, with or without stabilizers, buffers, and the like, to form a 
pharmaceutical composition. When it is desired to use a liposome delivery mechanism, 
standard protocols for formation of liposomes can be followed. The compositions of the 
present invention can also be formulated and used as tablets, capsules or elixirs for oral 
adrninistration; suppositories for rectal administration; sterile solutions; suspensions for 
injectable administration; and the other compositions known in the art. 

The present invention also includes pharmaceutically acceptable formulations of the 
compounds described. These formulations include salts of the above compounds, e.g, acid 
addition salts, for example, salts of hydrochloric, hydrobromic, acetic acid, and benzene 
sulfonic acid. 

A pharmacological composition or formulation refers to a composition or formulation 
in a form suitable for administration, e.g., systemic administration, into a cell or subject, 
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preferably a human. Suitable forms, in part, depend upon the use or the route of entry, for 
example oral, transdermal, or by injection. Such forms should not prevent the composition or 
formulation from reaching a target cell (i.e., a cell to which the negatively charged polymer is 
desired to be delivered to). For example, pharmacological compositions injected into the 
5 blood stream should be soluble. Other factors are known in the art, and include 
considerations such as toxicity and forms which prevent the composition or formulation from 
exerting its effect. 

By "systemic administration" is meant in vivo systemic absorption or accumulation of 
drugs in the blood stream followed by distribution throughout the entire body. 

10 Administration routes which lead to systemic absorption include, without limitations: 
intravenous, subcutaneous, intraperitoneal, inhalation, oral, intrapulmonary and 
intramuscular. Each of these administration routes expose the desired negatively charged 
polymers, e.g., nucleic acids, to an accessible diseased tissue. The rate of entry of a drug into 
the circulation has been shown to be a function of molecular weight or size. The use of a 

15 liposome or other drug carrier comprising the compounds of the instant invention can 
potentially localize the drug, for example, in certain tissue types, such as the tissues of the 
reticular endothelial system (RES). A liposome formulation that can facilitate the association 
of drug with the surface of cells, such as, lymphocytes and macrophages is also useful. This 
approach can provide enhanced delivery of the drug to target cells by taking advantage of the 

20 specificity of macrophage and lymphocyte immune recognition of abnormal cells, such as 
cancer cells. 

By pharmaceutically acceptable formulation is meant, a composition or formulation that 
allows for the effective distribution of the nucleic acid molecules of the instant invention in 
the physical location most suitable for their desired activity. Non-limiting examples of agents 

25 suitable for formulation with the nucleic acid molecules of the instant invention include: 
PEG conjugated nucleic acids, phospholipid conjugated nucleic acids, nucleic acids 
containing lipophilic moieties, phosphorothioates, P-glycoprotein inhibitors (such as Pluronic 
P85) which can enhance entry of drugs into various tissues, for exaple the CNS (Jolliet-Riant 
and Tillement, 1999, Fundam. Clin. Pharmacol., 13, 16-26); biodegradable polymers, such as 

30 poly (DL-lactide-coglycolide) microspheres for sustained release delivery after implantation 
(Emerich, DF et al, 1999, Cell Transplant, 8, 47-58) Alkermes, Inc. Cambridge, MA; and 
loaded nanoparticles, such as those made of polybutylcyanoacrylate, which can deliver drugs 
across the blood brain barrier and can alter neuronal uptake mechanisms (Prog 
Neuropsychopharmacol Biol Psychiatry, 23, 941-949, 1999). Other non-limiting examples of 

35 delivery strategies, including CNS delivery of the nucleic acid molecules of the instant 
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invention include material described in Boado et al, 1998, J. Pharm. Set, 87, 1308-1315; 
Tyler et al, 1999, FEBS Lett, All, 280-284; Pardridge et al, 1995, PNAS USA., 92, 5592- 
5596; Boado, 1995, Adv. Drug Delivery Rev., 15, 73-107; Aldrian-Herrada et al, 1998, 
Nucleic Acids Res., 26, 4910-4916; and Tyler et al, 1999, PNAS USA., 96, 7053-7058. All 
5 these references are hereby incorporated herein by reference. 

The invention also features the use of the composition comprising surface-modified 
liposomes containing poly (ethylene glycol) lipids (PEG-modified, or long-circulating 
liposomes or stealth liposomes). Nucleic acid molecules of the invention can also comprise 
covalently attached PEG molecules of various molecular weights. These formulations offer a 
1 0 method for increasing the accumulation of drugs in target tissues. This class of drug carriers 
resists opsonization and elimination by the mononuclear phagocytic system (MPS or RES), 
thereby enabling longer blood circulation times and enhanced tissue exposure for the 
encapsulated drug (Lasic et al. Chem. Rev. 1995, 95, 2601-2627; Ishiwata et al, Chem. 
Pharm. Bull. 1995, 43, 1005-1011). Such liposomes have been shown to accumulate 
1 5 selectively in tumors, presumably by extravasation and capture in the neovascularized target 
tissues (Lasic et al, Science 1995, 267, 1275-1276; Oku et al, 1995, Biochim. Biophys. Acta, 
1238, 86-90). The long-circulating liposomes enhance the pharmacokinetics and 
pharmacodynamics of DNA and RNA, particularly compared to conventional cationic 
liposomes, which are known to accumulate in tissues of the MPS (Liu et al, J. Biol Chem. 
20 1995, 42, 24864-24870; Choi et al, International PCT Publication No. WO 96/10391 ; Ansell 
et al, International PCT Publication No. WO 96/10390; Holland et al, International PCT 
Publication No. WO 96/10392; all of which are incorporated by reference herein). Long- 
circulating liposomes are also likely to protect drugs from nuclease degradation to a greater 
extent compared to cationic liposomes, based on their ability to avoid accumulation in 
25 metabolically aggressive MPS tissues such as the liver and spleen. All of these references are 
incorporated by reference herein. 

The present invention also includes compositions prepared for storage or administration 
that include a pharmaceutical^ effective amount of the desired compounds in a 
pharmaceutically acceptable carrier or diluent. Acceptable carriers or diluents for therapeutic 
30 use are well known in the pharmaceutical art, and are described, for example, in Remington 's 
Pharmaceutical Sciences, Mack Publishing Co. (A.R. Gennaro edit. 1985), hereby 
incorporated by reference herein. For example, preservatives, stabilizers, dyes and flavoring 
agents can be provided. These include sodium benzoate, sorbic acid and esters of p- 
hydroxybenzoic acid. In addition, antioxidants and suspending agents can be used. 
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A pharmaceutically effective dose is that dose required to prevent, inhibit the 
occurrence, or treat (alleviate a symptom to some extent, preferably all of the symptoms) of a 
disease state. The pharmaceutically effective dose depends on the type of disease, the 
• composition used, the route of administration, the type of mammal being treated, the physical 
5 characteristics of the specific mammal under consideration, concurrent medication, and other 
factors which those skilled in the medical arts will recognize. Generally, an amount between 
0.1 mg/kg and 100 nig/kg body weight/day of active ingredients is administered dependent 
upon potency of the negatively charged polymer. 

The nucleic acid molecules of the invention and formulations thereof can be 
1 0 administered orally, topically, parenterally, by inhalation or spray, or rectally in dosage unit 
formulations containing conventional non-toxic pharmaceutically acceptable carriers, 
adjuvants and/or vehicles. The term parenteral as used herein includes percutaneous, 
subcutaneous, intravascular (e.g., intravenous), intramuscular, or intrathecal injection or 
infusion techniques and the like. Li addition, there is provided a pharmaceutical formulation 
15 comprising a nucleic acid molecule of the invention and a pharmaceutically acceptable 
carrier. One or more nucleic acid molecules of the invention can be present in association 
with one or more non-toxic pharmaceutically acceptable carriers and/or diluents and/or 
adjuvants, and if desired other active ingredients. The pharmaceutical compositions 
containing nucleic acid molecules of the invention can be in a form suitable for oral use, for 
20 example, as tablets, troches, lozenges, aqueous or oily suspensions, dispersible powders or 
granules, emulsion, hard or soft capsules, or syrups or elixirs. 

Compositions intended for oral use can be prepared according to any method known to 
the art for the manufacture of pharmaceutical compositions and such compositions can 
contain one or more such sweetening agents, flavoring agents, coloring agents or preservative 

25 agents in order to provide pharmaceutically elegant and palatable preparations. Tablets 
contain the active ingredient in admixture with non-toxic pharmaceutically acceptable 
excipients that are suitable for the manufacture of tablets. These excipients can be, for 
example, inert diluents, such as calcium carbonate, sodium carbonate, lactose, calcium 
phosphate or sodium phosphate; granulating and disintegrating agents, for example, com 

30 starch, or alginic acid; binding agents, for example starch, gelatin or acacia, and lubricating 
agents, for example magnesium stearate, stearic acid or talc. The tablets can be uncoated or 
they can be coated by known techniques. In some cases such coatings can be prepared by 
known techniques to delay disintegration and absorption in the gastrointestinal tract and 
thereby provide a sustained action over a longer period. For example, a time delay material 

35 such as glyceryl monosterate or glyceryl distearate can be employed. 
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Formulations for oral use can also be presented as hard gelatin capsules wherein the 
active ingredient is mixed with an inert solid diluent, for example, calcium carbonate, calcium 
phosphate or kaolin, or as soft gelatin capsules wherein the active ingredient is mixed with 
water or an oil medium, for example peanut oil, liquid paraffin or olive oil. 

5 Aqueous suspensions contain the active materials in admixture with excipients suitable 

for the manufacture of aqueous suspensions. Such excipients are suspending agents, for 
example, sodium carboxymethylcellulose, methylcellulose, hydropropyl-methylcellulose, 
sodium alginate, polyvinylpyrrolidone, gum tragacanth and gum acacia; dispersing or wetting 
agents can be a naturally-occurring phosphatide, for example, lecithin, or condensation 

10 products of an alkylene oxide with fatty acids, for example polyoxyethylene stearate, or 
condensation products of ethylene oxide with long chain aliphatic alcohols, for example 
heptadecaethyleneoxycetanol, or condensation products of ethylene oxide with partial esters 
derived from fatty acids and a hexitol such as polyoxyethylene sorbitol monooleate, or 
condensation products of ethylene oxide with partial esters derived from fatty acids and 

1 5 hexitol anhydrides, for example polyethylene sorbitan monooleate. The aqueous suspensions 
can also contain one or more preservatives, for example, ethyl, or n-propyl p- 
hydroxybenzoate, one or more coloring agents, one or more flavoring agents, and one or more 
sweetening agents, such as sucrose or saccharin. 

Oily suspensions can be formulated by suspending the active ingredients in a vegetable 
20 oil, for example arachis oil, olive oil, sesame oil or coconut oil, or in a mineral oil such as 
liquid paraffin. The oily suspensions can contain a thickening agent, for example beeswax, 
hard paraffin or cetyl alcohol. Sweetening agents and flavoring agents can be added to 
provide palatable oral preparations. These compositions can be preserved by the addition of 
an anti-oxidant such as ascorbic acid. 

25 Dispersible powders and granules suitable for preparation of an aqueous suspension by 

the addition of water provide the active ingredient in admixture with a dispersing or wetting 
agent, suspending agent and one or more preservatives. Suitable dispersing or wetting agents 
or suspending agents are exemplified by those already mentioned above. Additional 
excipients, for example sweetening, flavoring and coloring agents, can also be present. 

30 Pharmaceutical compositions of the invention can also be in the form of oil-in-water 

emulsions. The oily phase can be a vegetable oil or a mineral oil or mixtures of these. 
Suitable emulsifying agents can be nalnrally-occiirring gums, for example gum acacia or gum 
tragacanth, naturaUy-occurring phosphatides, for example soy bean, lecithin, and esters or 
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partial esters derived from fatty acids and hexitol, anhydrides, for example, sorbitan 
monooleate, and condensation products of the said partial esters with ethylene oxide, for 
example polyoxyethylene sorbitan monooleate. The emulsions can also contain sweetening 
' and flavoring agents. 

5 Syrups and elixirs can be formulated with sweetening agents, for example glycerol, 

propylene glycol, sorbitol, glucose or sucrose. Such formulations can also contain a 
demulcent, a preservative and flavoring and coloring agents. The pharmaceutical 
compositions can be in the form of a sterile injectable aqueous or oleaginous suspension. 
This suspension can be formulated according to the known art using those suitable dispersing 

10 or wetting agents and suspending agents that have been mentioned above. The sterile 
injectable preparation can also be a sterile injectable solution or suspension in a non-toxic 
parentally acceptable diluent or solvent, for example as a solution in 1,3-butanediol. Among 
the acceptable vehicles and solvents that can be employed are water, Ringer's solution and 
isotonic sodium chloride solution. In addition, sterile, fixed oils are conventionally employed 

15 as a solvent or suspending medium. For this purpose any bland fixed oil can be employed 
including synthetic mono-or diglycerides. In addition, fatty acids such as oleic acid find use 
in the preparation of injectables. 

The nucleic acid molecules of the invention can also be administered in the form of 
suppositories, e.g., for rectal administration of the drug. These compositions can be prepared 
20 by mixing the drug with a suitable non-irritating excipient that is solid at ordinary 
temperatures but liquid at the rectal temperature and will therefore melt in the rectum to 
release the drug. Such materials include cocoa butter and polyethylene glycols. 

Nucleic acid molecules of the invention can be administered parenterally in a sterile 
medium. The drug, depending on the vehicle and concentration used, can either be suspended 
25 or dissolved in the vehicle. Advantageously, adjuvants such as local anesthetics, 
preservatives and buffering agents can be dissolved in the vehicle. 

Dosage levels of the order of from about 0.1 mg to about 140 mg per kilogram of body 
weight per day are useful in the treatment of the above-indicated conditions (about 0.5 mg to 
about 7 g per patient or subject per day). The amount of active ingredient that can be 
30 combined with the carrier materials to produce a single dosage form varies depending upon 
the host treated and the particular mode of administration. Dosage unit forms generally 
contain between from about 1 mg to about 500 mg of an active ingredient. 
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It is understood that the specific dose level for any particular patient or subject depends 
upon a variety of factors including the activity of the specific compound employed, the age, 
body weight, general health, sex, diet, time of administration, route of administration, and rate 
of excretion, drug combination and the severity of the particular disease undergoing therapy. 

5 For administration to non-human animals, the composition can also be added to the 

animal feed or drinking water. It can be convenient to formulate the animal feed and drinking 
water compositions so that Ihe animal takes in a therapeutically appropriate quantity of the 
composition along with its diet. It can also be convenient to present the composition as a 
premix for addition to the feed or drinking water. 

10 The nucleic acid molecules of the present invention can also be administered to a 

patient or subject in combination with other therapeutic compounds to increase the overall 
therapeutic effect. The use of multiple compounds to treat an indication can increase the 
beneficial effects while reducing the presence of side effects. 

In another aspect of the invention, nucleic acid molecules of the present invention are 
1 5 preferably expressed from transcription units (see for example Couture et al, 1996, TIG., 12, 
510, Skillern et al, International PCT Publication No. WO 00/22113, Conrad, International 
PCT Publication No. WO 00/221 14, and Conrad, US 6,054,299) inserted into DNA or RNA 
vectors. The recombinant vectors are preferably DNA plasmids or viral vectors. Enzymatic 
nucleic acid expressing viral vectors can be constructed based on, but not limited to, adeno- 
20 associated virus, retrovirus, adenovirus, or alphavirus. Preferably, the recombinant vectors 
capable of expressing the nucleic acid molecules are delivered as described above, and persist 
in target cells. Alternatively, viral vectors can be used that provide for transient expression of 
nucleic acid molecules. Such vectors can be repeatedly administered as necessary. Once 
expressed, the nucleic acid molecule binds to the target mRNA. Delivery of nucleic acid 
25 molecule expressing vectors can be systemic, such as by intravenous or intra-muscular 
administration, by administration to target cells ex-planted from the subject followed by 
reintroduction into the subject, or by any other means that would allow for introduction into 
the desired target cell (for a review see Couture et al, 1996, TIG., 12, 510). 

One aspect of the invention features an expression vector comprising a nucleic acid 
30 sequence encoding at least one of the nucleic acid molecules of the instant invention. The 
nucleic acid sequence encoding the nucleic acid molecule of the instant invention is operably 
linked in a manner that allows expression of that nucleic acid molecule. 
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Another aspect the invention features an expression vector comprising nucleic acid 
sequence encoding at least one of the nucleic acid molecules of the invention, in a manner 
which allows expression of that nucleic acid molecule. The expression vector comprises in 
; one embodiment; a) a transcription initiation region; b) a transcription termination region; c) 
a nucleic acid sequence encoding at least one said nucleic acid molecule; and wherein said 
sequence is operably linked to said initiation region and said termination region, in a manner 
that allows expression and/or delivery of said nucleic acid molecule. 

In another embodiment, the expression vector comprises: a) a transcription initiation 
region; b) a transcription termination region; c) an open reading frame; d) a nucleic acid 
sequence encoding at least one said nucleic acid molecule, wherein said sequence is operably 
linked to the 3 -end of said open reading frame; and wherein said sequence is operably linked 
to said initiation region, said open reading frame and said termination region, in a manner 
which allows expression and/or delivery of said nucleic acid molecule. In yet another 
embodiment the expression vector comprises: a) a transcription initiation region; b) a 
transcription termination region; c) an intron; d) a nucleic acid sequence encoding at least one 
said nucleic acid molecule; and wherein said sequence is operably linked to said initiation 
region, said intron and said termination region, in a manner which allows expression and/or 
delivery of said nucleic acid molecule. 

In another embodiment, the expression vector comprises: a) a transcription initiation 
region; b) a transcription termination region; c) an intron; d) an open reading frame; e) a 
nucleic acid sequence encoding at least one said nucleic acid molecule, wherein said sequence 
is operably linked to the 3 f -end of said open reading frame; and wherein said sequence is 
operably linked to said initiation region, said intron, said open reading frame and said 
termination region, in a manner which allows expression and/or delivery of said nucleic acid 
molecule. 

Examples 

The following are non-limiting examples showing the selection, isolation, synthesis and 
activity of nucleic acids of the instant invention. 

Example 1 : Identification of Potential Target Sites in Human Ras RNA 

The sequence of human Ras genes were screened for accessible sites using a computer- 
folding algorithm. Regions of the RNA that do not form secondary folding structures and 
contain potential enzymatic nucleic acid molecule and/or antisense binding/cleavage sites 
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were identified. The sequences of K-Ras and H-Ras binding/cleavage sites are shown in 
Tables II and m. 

Exam ple 2: Selection of Enzvmatic N u cleic Acid Cleavage Sites in Human Ras RNA 

Enzymatic nucleic acid molecule target sites were chosen by analyzing sequences of 
5 Human K-Ras and H-Ras (for example, Genbank accession Nos: NM_004985 and 
NM 005343 respectively) and prioritizing the sites on the basis of folding. Enzymatic 
nucleic acid molecules were designed that can bind each target and were individually 
analyzed by computer folding (Christoffersen et aL, 1994 J. Mol Struc. Theochem, 311, 273; 
Jaeger et a!., 1989, Proc. Natl. Acad. Sci. USA, 86, 7706) to assess whether the enzymatic 
10 nucleic acid molecule sequences fold into the appropriate secondary structure. Those 
enzymatic nucleic acid molecules with unfavorable intramolecular interactions between the 
binding arms and the catalytic core are eliminated from consideration. As noted below, 
varying binding arm lengths can be chosen to optimize activity. Generally, at least 5 bases on 
each arm are able to bind to, or otherwise interact with, the target RNA. 

15 Example 3: Chemical Synthesis and Purifi c ation of Enzvmatic Nucleic Acid Molecules for 
Efficient Cleavage and/or bl ocking of Ras RNA 

DNAzyme molecules are designed to anneal to various sites in the RNA message. The 
binding arms of the DNAzyme molecules are complementary to the target site sequences 
described above. The DNAzymes were chemically synthesized. The method of synthesis 

20 used followed the procedure for nucleic acid synthesis as described herein and in Usman et 
al, (1987 J. Am. Chem. Soc, 109, 7845), Scaringe et aL, (1990 Nucleic Acids Res., 18, 
5433) and Wincott et al, supra, and made use of common nucleic acid protecting and 
coupling groups, such as dimethoxytrityl at the 5'-end, and phosphoramidites at the 3'-end. 
The average stepwise coupling yields were typically >98%. The sequences of the chemically 

25 synthesized DNAzyme molecules used in this study are shown below in Tables H and TSL 

Exam ple 4: DNAzvme Cleavage of Ra s RNA Target in vitro 

DNAzymes targeted to the human K-Ras and H-Ras RNA are designed and synthesized 
as described above. These enzymatic nucleic acid molecules can be tested for cleavage 
activity in vitro, for example, using the following procedure. The target sequences and the 
30 nucleotide location within the K-Ras and H-Ras RNA are given in Tables H and III 
respectively. 
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Cleavage Reactions: 

DNAzymes and substrates were synthesized in 96-well fonnat using 0.2pmol scale. 
^Substrates were 5'- 32 P labeled and gel purified using 7.5% polyacrylamide gels, and eluting 
into water. Assays were done by combining trace substrate with 500nM DNAzyme or 
5 greater, and initiated by adding final concentrations of 40mM Mg +2 , and 50mM Tris-Cl pH 
8.0. For each DNAzyme/substrate combination a control reaction was done to ensure 
cleavage was not the result of non-specific substrate degradation. A single three hour time 
point was taken and run on a 15% polyacrylamide gel to asses cleavage activity. Gels were 
dried and scanned using a Molecular Dynamics Phosphorimager and quantified using 
1 0 Molecular Dynamics ImageQuant software. Percent cleaved was determined by dividing 
•Values for cleaved substrate bands by full-length (uncleaved) values plus cleaved values and 
multiplying by 100 (%cleaved=[C/(U+C)]*100). 

Example 5: DNAzvme Cleavage of Ras RNA Target in vivo 
Cell Culture 

1 5 Wickstrom, 2001, Mol Biotechnol, 18, 35-35, describes a cell culture system in which 

antisense oligonucleotides targeting H-Ras were studied in transformed mouse cells that form 
solid tumors. Treatment of cells with antisense targeting H-Ras resulted in the sequence 
specific and dose dependent inhibition of H-Ras expression. In this study, it was determined 
that antisense targeting the first intron region of H-Ras were more effective than antisense 

20 targeting the initiation codon region. 

Kita et al, 1999, Int. J. Cancer, 80, 553-558, describes the growth inhibition of human 
pancreatic cancer cell lines by antisense oligonucleotides specific to mutated K-Ras genes. 
Antisense oligonucleotides were transfected to the transformed cells using liposomes. 
Cellular proliferation, K-Ras mRNA expression, and K-Ras protein synthesis were all 
25 evaluated as endpoints. Sato et al, 2000, Cancer Lett., 155, 153-161, describes another 
human pancreatic cancer cell line, HOR-P1, that is characterized by high angiogenic activity 
and metastatic potential. Genetic and molecular analysis of this cell line revealed both 
increased telomerase activity and a mutation in the K-Ras oncogene. 

A variety of endpoints have been used in cell culture models to look at Ras-mediated 
30 effects after treatment with anti-Ras agents. Phenotypic endpoints include inhibition of cell 
proliferation, RNA expression, and reduction of Ras protein expression. Because Ras 
oncogene mutations are directly associated with increased proliferation of cetain tumor cells, 
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a proliferation endpoint for cell culture assays is preferably used as the primary screen. There 
are several methods by which this endpoint can be measured. Following treatment of cells 
with DNAzymes, cells are allowed to grow (typically 5 days) after which eitiier the cell 
viability, the incorporation of [ 3 H] thymidine into cellular DNA and/or the cell density can be 
measured. The assay of cell density is done in a 96-well format using commercially available 
fluorescent nucleic acid stains (such as Syto® 13 or CyQuant®). As a secondary, 
confirmatory endpoint a DNAzyme-mediated decrease in the level of Ras protein expression 
is evaluated using a Ras-specific ELIS A. 



Animal Models 

0 Evaluating the efficacy of anti-Ras agents in animal models is an important prerequisite 

to human clinical trials. As in cell culture models, the most Ras sensitive mouse tumor 
xenografts are those derived from cancer cells that express mutant Ras proteins. Nude mice 
bearing H-Ras transformed bladder cancer cell xenografts were sensitive to an anti-Ras 
antisense nucleic acid, resulting in an 80% inhibition of tumor growth after a 31 day treatment 
5 period (Wickstrom, 2001, Mol. Biotechnol, 18, 35-35). Zhang et al, 2000, Gene Then, 7, 
2041, describes an anti-K-Ras ribozyme adenoviral vector (KRbz-ADV) targeting a K-Ras 
mutant (K-Ras codon 12 GGT to GTT; H441 and H1725 cells respectively). Non-small cell 
lung cancer cells (NSCLC H441 and H1725 cells) that express the mutant K-Ras protein were 
used in nude mouse xenografts compared to NSCLC H1650 cells that lack the relevant 
20 mutation. Pre-treatment with KRbz-ADV completely abrogated engraftment of both H441 
and H1725 cells and compared to 100% engraftment and tumor growth in animals that 
received untreated tumor cells or a control vector. The above studies provide proof that 
inhibition of Ras expression by anti-Ras agents causes inhibition of tumor growth in animals. 
Anti-Ras DNAzymes chosen from in vitro assays are further tested in similar mouse 
25 xenograft models. Active DNAzymes are subsequently tested in combination with standard 
chemotherapies. 

Indications 

Particular degenerative and disease states that are associated with Ras expression 
modulation include but are not limited to cancer, for example lung cancer, colorectal cancer, 
30 bladder cancer, pancreatic cancer, breast cancer, prostate cancer and/or any other diseases or 
conditions that are related to or will respond to the levels of Ras in a cell or tissue, alone or in 
combination with other therapies. 
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The present body of knowledge in Ras research indicates the need for methods to assay 
Ras activity and for compounds that can regulate Ras expression for research, diagnostic, and 
therapeutic use. 

The use of monoclonal antibodies, chemotherapy, radiation therapy, and analgesics, are 
5 all non-limiting examples of methods that can be combined with or used in conjunction with 
the nucleic acid molecules (e.g. DNAzymes) of the instant invention. Common 
chemotherapies that can be combined with nucleic acid molecules of the instant invention 
include various combinations of cytotoxic drugs to kill cancer cells. These drugs include but 
are not limited to paclitaxel (Taxol), docetaxel, cisplatin, methotrexate, cyclophosphamide, 
10 doxorubin, fluorouracil carboplatin, edatrexate, gemcitabine, vinorelbine etc. Those skilled 
v in the art will recognize that other drug compounds and therapies can be similarly be readily 
combined with the nucleic acid molecules of the instant invention (e.g. DNAzyme molecules) 
are hence within the scope of the instant invention. 

Diagnostic uses 

1 5 The nucleic acid molecules of this invention (e.g., enzymatic nucleic acid molecules) 

are used as diagnostic tools to examine genetic drift and mutations within diseased cells or to 
detect the presence of Ras RNA in a cell. The close relationship between enzymatic nucleic 
acid molecule activity and the structure of the target RNA allows the detection of mutations 
in any region of the molecule that alters the base-pairing and three-dimensional structure of 

20 the target RNA. Using multiple enzymatic nucleic acid molecules described in this invention, 
one maps nucleotide changes which are important to RNA structure and function in vitro, as 
well as in cells and tissues. Cleavage of target RNAs with enzymatic nucleic acid molecules 
are used to inhibit gene expression and define the role (essentially) of specified gene products 
in the progression of disease. In this manner, other genetic targets are defined as important 

25 mediators of the disease. These experiments lead to better treatment of the disease 
progression by affording the possibility of combinational therapies (e.g., multiple enzymatic 
nucleic acid molecules targeted to different genes, enzymatic nucleic acid molecules coupled 
with known small molecule inhibitors, or intermittent treatment with combinations of 
enzymatic nucleic acid molecules and/or other chemical or biological molecules). Other in 

30 vitro uses of enzymatic nucleic acid molecules of this invention are known in the art, and 
include detection of the presence of mRNAs associated with Ras-related condition. Such 
RNA is detected by determining the presence of a cleavage product after treatment with an 
enzymatic nucleic acid molecule using standard methodology. 
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In a specific example, enzymatic nucleic acid molecules that cleave only wild-type or 
mutant forms of the target RNA are used for the assay. The first enzymatic nucleic acid 
molecule is used to identify wild-type RNA present in the sample and the second enzymatic 
nucleic acid molecule is used to identify mutant RNA in the sample. As reaction controls, 
synthetic substrates of both wild-type and mutant RNA are cleaved by both enzymatic nucleic 
acid molecules to demonstrate the relative enzymatic nucleic acid molecule efficiencies in the 
reactions and the absence of cleavage of the "non-targeted" RNA species. The cleavage 
products from the synthetic substrates also serve to generate size markers for the analysis of 
wild-type and mutant RNAs in the sample population. Thus each analysis requires two 
enzymatic nucleic acid molecules, two substrates and one unknown sample which is 
combined into six reactions. The presence of cleavage products is determined using an 
RNAse protection assay so that mil-length and cleavage fragments of each RNA can be 
analyzed in one lane of a polyacrylamide gel. It is not absolutely required to quantify the 
results to gain insight into the expression of mutant RNAs and putative risk of the desired 
phenotypic changes in target cells. The expression of mRNA whose protein product is 
implicated in the development of the phenotype Ras) is adequate to establish risk. If 
probes of comparable specific activity are used for both transcripts, then a qualitative 
comparison of RNA levels will be adequate and will decrease the cost of the initial diagnosis. 
Higher mutant form to wild-type ratios are correlated with higher risk whether RNA levels 
are compared qualitatively or quantitatively. The use of enzymatic nucleic acid molecules in 
diagnostic applications contemplated by the instant invention is described, for example, in 
George et al, US Patent Nos. 5,834,186 and 5,741,679, Shih et al, US Patent No. 5,589,332, 
Nathan et al, US Patent No 5,871,914, Nathan and Ellington, International PCT publication 
No. WO 00/24931, Breaker et al, International PCT Publication Nos. WO 00/26226 and 
98/27104, and Sullenger et al, International PCT publication No. WO 99/29842. 

Rxamole 6: Identification of Potential Target Sites in Human HIV RNA 

The sequence of human HTV genes are screened for accessible sites using a computer- 
folding algorithm. Regions of the RNA that do not form secondary folding structures and 
contained potential enzymatic nucleic acid molecule and/or antisense binding/cleavage sites 
are identified. The sequences of these binding/cleavage sites are shown in Tables VI to XL 

Example 6: Selection of Enzvmatic Nucleic Acid C leavage Sites in Human HTV RNA 

Enzymatic nucleic acid molecule target sites were chosen by analyzing sequences of 
Human HTV (Genbank accession No: NM 005228) and prioritizing the sites on the basis of 
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folding. Enzymatic nucleic acid molecules were designed that can bind each target and are 
individually analyzed by computer folding (ChristofFersen et al, 1994 J. MoL Struc. 
Theochem, 311, 273; Jaeger et aL, 1989, Proa Natl Acad, Sci. USA, 86, 7706) to assess 
whether the enzymatic nucleic acid molecule sequences fold into the appropriate secondary 
5 structure. Those enzymatic nucleic acid molecules with unfavorable intramolecular 
interactions between the binding arms and the catalytic core were eliminated from 
consideration. As noted below, varying binding arm lengths can be chosen to optimize 
activity. Generally, at least 5 bases on each aim are able to bind to, or otherwise interact 
with, the target RNA. 

10- n Example 8: Chemical Synthesis and Purification of Ribozvmes and Antisense for Efficient 
Cleavage and/or blocking of HIV Activity 

Enzymatic nucleic acid molecules and antisense constructs are designed to anneal to 
various sites in the RNA message. The binding arms of the enzymatic nucleic acid molecules 
are complementary to the target site sequences described above, while the antisense 

15 constructs are fully complementary to the target site sequences described above. The 
enzymatic nucleic acid molecules and antisense constructs were chemically synthesized. The 
method of synthesis used followed the procedure for normal RNA synthesis as described 
above and in Usman et al, (1987 J. Am. Chem. Soc, 109, 7845), Scaringe et aL, (1990 
Nucleic Acids Res., 18, 5433) and Wincott et aL, supra, and made use of common nucleic 

20 acid protecting and coupling groups, such as dimethoxytrityl at the 5-end 5 and 
phosphoramidites at the 3'-end. The average stepwise coupling yields were typically >98%. 

Enzymatic nucleic acid molecules and antisense constructs are also synthesized from 
DNA templates using bacteriophage T7 RNA polymerase (Milligan and Uhlenbeck, 1989, 
Methods Enzymol. 180, 51). Enzymatic nucleic acid molecules and antisense constructs are 

25 purified by gel electrophoresis using general methods or are purified by high pressure liquid 
chromatography (HPLC; See Wincott et aL, supra; the totality of which is hereby 
incorporated herein by reference) and are resuspended in water. The sequences of the 
chemically synthesized enzymatic nucleic acid molecules used in this study are shown below 
in Table XI. The sequences of the chemically synthesized antisense constructs used in this 

30 study are complementary sequences to the Substrate sequences shown below as in Tables VI 
to XI. 

Example 8: Enzymatic nucleic acid molecule Cleavage of HIV RNA Target in vitro 
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Enzymatic nucleic acid molecules targeted to the human HIV RNA are designed and 
synthesized as described above. These enzymatic nucleic acid molecules are tested for 
cleavage activity in vitro, for example, using the following procedure. The target sequences 
and the nucleotide location within the HW RNA are given in Tables VI to XI. 

5 Cleavage Reactions: Full-length or partially full-length, internally-labeled target RNA 

for enzymatic nucleic acid molecule cleavage assay is prepared by in vitro transcription in the 
presence of [a-32p] CTP, passed over a G 50 Sephadex column by spin chromatography and 
used as substrate RNA without further purification. Alternately, substrates are 5 , -32p-end 
labeled using T4 polynucleotide kinase enzyme. Assays are performed by pre-wanning a 2X 

1 0 concentration of purified enzymatic nucleic acid molecule in enzymatic nucleic acid molecule 
cleavage buffer (50 mM Tris-HCl, pH 7.5 at 37°C, 10 mM MgCl 2 ) and the cleavage reaction 
was initiated by adding the 2X enzymatic nucleic acid molecule mix to an equal volume of 
substrate RNA (maximum of 1-5 nM) that was also pre-warmed in cleavage buffer. As an 
initial screen, assays are carried out for 1 hour at 37°C using a final concentration of either 40 

15 nM or 1 mM enzymatic nucleic acid molecule, i.e.. enzymatic nucleic acid molecule excess. 
The reaction is quenched by the addition of an equal volume of 95% formamide, 20 mM 
EDTA 0.05% bromophenol blue and 0.05% xylene cyanol after which the sample is heated 
to 95°C for 2 minutes, quick chilled and loaded onto a denaturing polyacrylamide gel. 
Substrate RNA and the specific RNA cleavage products generated by enzymatic nucleic acid 

20 molecule cleavage are visualized on an autoradiograph of the gel. The percentage of cleavage 
is determined by Phosphor Imager® quantitation of bands representing the intact substrate 
and the cleavage products. 

Indications 

Particular degenerative and disease states that can be associated with HTV expression 
25 modulation include but are not limited to acquired immunodeficiency disease (AIDS) and 
related diseases and conditions, including but not limited to Kaposi's sarcoma, lymphoma, 
cervical cancer, squamous cell carcinoma, cardiac myopathy, rheumatic diseases, and 
opportunistic infection, for example Pneumocystis carinii, Cytomegalovirus, Herpes simplex, 
Mycobacteria, Cryptococcus, Toxoplasma, Progressive multifocal leucoencepalopathy 
30 (Papovavirus), Mycobacteria, Aspergillus, Cryptococcus, Candida, Cryptosporidium, Isospora 
belli, Microsporidia and any other diseases or conditions that are related to or will respond to 
the levels of HTV in a cell or tissue, alone or in combination with other therapies 
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The present body of knowledge in HIV research indicates the need for methods to assay 
HIV activity and for compounds that can regulate HIV expression for research, diagnostic, 
and therapeutic use. 

The use of antiviral compounds, monoclonal antibodies, chemotherapy, radiation 
5 therapy, analgesics, and/or anti-inflammatory compounds, are all non-limiting examples of a 
methods that can be combined with or used in conjunction with the nucleic acid molecules 
(e.g. ribozymes and antisense molecules) of the instant invention. Examples of antiviral 
compounds that can be used in conjunction with the nucleic acid molecules of the invention 
include but are not limited to AZT (also known as zidovudine or ZDV), ddC (zalcitabine), ddl 

10 (dideoxyinosine), d4T (stavudine), and 3TC (lamivudine) Ribavirin, delvaridine (Rescriptor), 
nevirapine (Viramune), efravirenz (Sustiva), ritonavir (Norvir), saquinivir (fcivirase), 
indinavir (Crixivan), amprenivir (Agenerase), nelfinavir (Viracept), and/or lopinavir 
(Kaletra). Common chemotherapies that can be combined with nucleic acid molecules of the 
instant invention include various combinations of cytotoxic drugs to kill cancer cells. These 

1 5 drugs include but are not limited to paclitaxel (Taxol), docetaxel, cisplatin, methotrexate, 
cyclophosphamide, doxorubin, fluorouracil carboplatin, edatrexate, gemcitabine, vinorelbine 
etc. Those skilled in the art will recognize that other drug compounds and therapies can be 
similarly be readily combined with the nucleic acid molecules of the instant invention (e.g. 
ribozymes and antisense molecules) are hence within the scope of the instant invention. 

20 Diagnostic uses 

The nucleic acid molecules of this invention (e.g., enzymatic nucleic acid molecules) 
are used as diagnostic tools to examine genetic drift and mutations within diseased cells or to 
detect the presence of HTV RNA in a cell. The close relationship between enzymatic nucleic 
acid molecule activity and the structure of the target RNA allows the detection of mutations 

25 in any region of the molecule which alters the base-pairing and three-dimensional structure of 
the target RNA. Using multiple enzymatic nucleic acid molecules described in this invention, 
one maps nucleotide changes which are important to RNA structure and function in vitro, as 
well as in cells and tissues. Cleavage of target RNAs with enzymatic nucleic acid molecules 
are used to inhibit gene expression and define the role (essentially) of specified gene products 

30 in the progression of disease. In this manner, other genetic targets are defined as important 
mediators of the disease. These experiments lead to better treatment of the disease 
progression by affording the possibility of combinational therapies (e.g., multiple enzymatic 
nucleic acid molecules targeted to different genes, enzymatic nucleic acid molecules coupled 
with known small molecule inhibitors, or intermittent treatment with combinations of 
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enzymatic nucleic acid molecules and/or other chemical or biological molecules). Other m 
vitro uses of enzymatic nucleic acid molecules of this invention are well known in the art, and 
include detection of the presence of mRNAs associated with HIV-related condition. Such 
RNA is detected by determining the presence of a cleavage product after treatment with an 
5 enzymatic nucleic acid molecule using standard methodology. 

In a specific example, enzymatic nucleic acid molecules which cleave only wild-type or 
mutant forms of the target RNA are used for the assay. The first enzymatic nucleic acid 
molecule is used to identify wild-type RNA present in the sample and the second enzymatic 
nucleic acid molecule is used to identify mutant RNA in the sample. As reaction controls, 
1 0 synthetic substrates of both wild-type and mutant RNA are cleaved by both enzymatic nucleic 
acid molecules to demonstrate the relative enzymatic nucleic acid molecule efficiencies in the 
reactions and the absence of cleavage of the "non-targeted" RNA species. The cleavage 
products from the synthetic substrates also serve to generate size markers for the analysis of 
wild-type and mutant RNAs in the sample population. Thus each analysis requires two 
15 enzymatic nucleic acid molecules, two substrates and one unknown sample which is 
combined into six reactions. The presence of cleavage products is determined using an 
RNAse protection assay so that full-length and cleavage fragments of each RNA can be 
analyzed in one lane of a polyacrylamide gel. It is not absolutely required to quantify the 
results to gain insight into the expression of mutant RNAs and putative risk of the desired 
20 phenotypic changes in target cells. The expression of mRNA whose protein product is 
implicated in the development of the phenotype («.&, HTV) is adequate to establish risk. If 
probes of comparable specific activity are used for both transcripts, then a qualitative 
comparison of RNA levels will be adequate and will decrease the cost of the initial diagnosis. 
Higher mutant form to wild-type ratios are correlated with higher risk whether RNA levels 
25 are compared qualitatively or quantitatively. The use of enzymatic nucleic acid molecules in 
diagnostic applications contemplated by the instant invention is more fully described in 
George et al, US Patent Nos. 5,834,186 and 5,741,679, Shih et al, US Patent No. 5,589,332, 
Nathan et al, US Patent No 5,871,914, Nathan and Ellington, International PCT publication 
No. WO 00/24931, Breaker et al, International PCT Publication Nos. WO 00/26226 and 
30 98/27104, and Sullenger et al, International PCT publication No. WO 99/29842. 

F.yam ple 10: Identification of Potential Target Sites in Huma n HER2 RNA 

The sequence of human HER2 genes were screened for accessible sites using a 
computer-folding algorithm. Regions of the RNA that do not form secondary folding 
structures and contained potential enzymatic nucleic acid molecule and/or antisense 
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binding/cleavage sites were identified. The sequences of these binding/cleavage sites are 
shown in Tables IV and V. 

„ Example 10: Selection of Enzymatic Nucleic Acid Cleavage Sites in Human HER2 RNA 

Enzymatic nucleic acid molecule target sites were chosen by analyzing sequences of 
5 Human HER2 (Genbank accession No: X03363) and prioritizing the sites on the basis of 
. folding. Enzymatic nucleic acid molecules were designed that can bind each target and are 
. individually analyzed by computer folding (Christoffersen et aL, 1994 J. MoL Struc. 
Theochem, 311, 273; Jaeger et aL, 1989, Proc. Natl Acad. Set USA, 86, 7706) to assess 
whether the enzymatic nucleic acid molecule sequences fold into the appropriate secondary 
10 ; - structure. Those enzymatic nucleic acid molecules with unfavorable intramolecular 
interactions between the binding arms and the catalytic core were eliminated from 
consideration. As noted below, variable binding arm lengths are chosen to optimize activity. 
Generally, at least 5 bases on each arm are able to bind to, or otherwise interact with, the 
target RNA. 

15 Example 12: Chemical Synthesis and Purification of Ribozvmes and Antisense for Efficient 
Cleavage and/or Blocking of HER2 Expression 

DNAzyme molecules are designed to anneal to various sites in the RNA message. The 
binding arms of the DNAzyme molecules are complementary to the target site sequences 
described above. The DNAzymes were chemically synthesized. The method of synthesis 
20 used followed the procedure for nucleic acid synthesis as described above and in Usman et 
aL, (1987 J. Am. Chem. Soc, 109, 7845), Scaringe et aL, (1990 Nucleic Acids Res., 18, 
5433) and Wincott et aL, supra, and made use of common nucleic acid protecting and 
* coupling groups, such as dimethoxytrityl at the 5-end, and phosphoramidites at the 3-end. 
The average stepwise coupling yields were typically >98%. The sequences of the chemically 
25 synthesized DNAzyme molecules used in this study are shown below in Table V. 

- Example 13: DNAzyme Cleavage of HER2 RNA Target in vitro 

DNAzymes targeted to the human HER2 RNA are designed and synthesized as 
described above. These enzymatic nucleic acid molecules can be tested for cleavage activity 
in vitro, for example, using the following procedure. The target sequences and the nucleotide 
30 location within the HER2 RNA are given in Tables IV and V. 
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Cleavage Reactions: 

Ribozymes and substrates were synthesized in 96-well format using O^pmol scale. 
Substrates were 5'- 32 P labeled and gel purified using 7.5% polyacrylamide gels, and eluting 
into water. Assays were done by combining trace substrate with 500nM Ribozyme or greater, 

5 and initiated by adding final concentrations of 40mM Mg +2 , and 50mM Tris-Cl pH 8.0. For 
each ribozyme/substrate combination a control reaction was done to ensure cleavage was not 
the result of non-specific substrate degradation. A single three hour time point was taken and 
run on a 15% polyacrylamide gel to asses cleavage activity. Gels were dried and scanned 
using a Molecular Dynamics Phosphorimager and quantified using Molecular Dynamics 

10 ImageQuant software. Percent cleaved was determined by dividing values for cleaved 
subsu-ate bands by full-length (uncleaved) values plus cleaved values and multiplying by 100 
(%cleaved=[C/(U+C)]* 100). 

Kxanrole 14: DNAzvme Cleavage of HER2 RNA Target in vivo. 
Cell Culture Review 

1 5 The greatest KER2 specific effects have been observed in cancer cell lines that express 

high levels of HER2 protein (as measured by ELBA). Specifically, in one study that treated 
five human breast cancer cell lines with the HER2 antibody (anti-erbB2-sFv), the greatest 
inhibition of cell growth was seen in three cell lines (MDA-MB-361, SKBR-3 and BT-474) 
that express high levels of HER2 protein. No inhibition of cell growth was observed in two 
20 cell lines (MDA-MB-23 1 and MCF-7) that express low levels of HER2 protein (Wright, M., 
Grim, J., Deshane, J., Kim, M., Strong, T.V., Siegel, G.P., Curiel, D.T. (1997) An 
intracellular anti-erbB-2 single-chain antibody is specifically cytotoxic to human breast 
carcinoma cells overexposing erbB-2. Gene Therapy 4: 317-322). Another group 
successfully used SKBR-3 cells to show HER2 antisense ohgonucleotide-mediated inhibition 
25 of HER2 protein expression and HER2 RNA knockdown (Vaughn, J.P., Iglehart, J.D., 
Demirdji, S., Davis, P., Babiss, L.E., Caruthers, M.H., Marks, J.R. (1995) Antisense DNA 
downregulation of the ERBB2 oncogene measured by a flow cytometric assay. Proc Natl 
Acad Sci USA 92: 8338-8342). Other groups have also demonstrated a decrease in the levels 
of HER2 protein, HER2 mRNA and/or cell proliferation in cultured cells using anti-HER2 
30 DNAzymes or antisense molecules (Suzuki T., Curcio, L.D., Tsai, J. and Kashani-Sabet M. 
(1997) Anti-c-eri>-B-2 Ribozyme for Breast Cancer. In Methods in Molecular Medicine, Vol. 
11, Therapeutic Applications of Ribozmes, Human Press, Inc., Totowa, NJ; Weichen, K., 
Zimmer, C. and Dietel, M. (1997) Selection of a high activity c-erbB-2 ribozyme using a 
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fusion gene of c-erbB-2 and the enhanced green fluorescent protein. Cancer Gene Therapy 5: 
45-51; Czubayko, R, Downing, S.G., Hsieh, S.S., Goldstein, D.J., Lu P.Y., Trapnell, B.C. 
and Wellstein, A. (1997) Adenovirus-mediated transduction of ribozymes abrogates HER- 
2/neu and pleiotrophin expression and inhibits tumor cell proliferation. Gene Ther. 4: 943- 
5 949; Colomer, R., Lupu, R., Bacus, S.S. and Gelmann, E.P. (1994) erbB-2 antisense 
oligonucloetides inhibit the proliferation of breast carcinoma cells with erbB-2 oncogene 
amplification. British J. Cancer 70: 819-825; Betram et al, 1994). Because cell lines that 
express higher levels of HER2 have been more sensitive to anti-HER2 agents, we prefer using 
several medium to high expressing cell lines, including SKBR-3 and T47D, for DNAzyme 
1 0 screens in cell culture. 

A variety of endpoints have been used in cell culture models to look at HER2-mediated 
effects after treatment with anti-HER2 agents. Phenotypic endpoints include inhibition of cell 
proliferation, apoptosis assays and reduction of HER2 protein expression. Because 
overexpression of HER2 is directly associated with increased proliferation of breast and 

1 5 ovarian tumor cells, a proliferation endpoint for cell culture assays will preferably be used as 
the primary screen. There are several methods by which this endpoint can be measured. 
Following treatment of cells with DNAzymes, cells are allowed to grow (typically 5 days) 
after which either the cell viability, the incorporation of [ 3 H] thymidine into cellular DNA 
and/or the cell density can be measured. The assay of cell density is very straightforward and 

20 can be done in a 96-well format using commercially available fluorescent nucleic acid stains 
(such as Syto® 13 or CyQuant®). The assay using CyQuant® is described herein and is 
currently being employed to screen -100 DNAzymes targeting HER2 (details below). 

As a secondary, confirmatory endpoint a DNAzyme-mediated decrease in the level of 
HER2 protein expression can be evaluated using a HER2-specific ELBA. 

25 Validation of Cell Lines and DNAzyme Treatment Conditions 

Two human breast cancer cell lines (T47D and SKBR-3) that are known to express 
medium to high levels of HER2 protein, respectively, are considered for DNAzyme 
screening. In order to validate these cell lines for HER2-mediated sensitivity, both cell lines 
are treated with the HER2 specific antibody, Herceptin® (Genentech) and its effect on cell 
30 proliferation is determined. Herceptin® is added to cells at concentrations ranging from 0-8 
pM in medium containing either no serum (OptiMem), 0.1% or 0.5% FBS and efficacy is 
determined via cell proliferation. Maximal inhibition of proliferation (-50%) in both cell 
lines is typically observed after addition of Herceptin® at 0.5 nM in medium containing 0.1% 
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or no FBS. The fact that both cell lines are sensitive to an anti-HER2 agent (Herceptin®) 
supports their use in experiments testing anti-HER2 DNAzymes. 

Prior to DNAzyme screening, the choice of the optimal lipid(s) and conditions for 
DNAzyme delivery is determined empirically for each cell line. Applicant has established a 
panel of cationic lipids (lipids as described in PCT application WO99/05094) that can be used 
to deliver DNAzymes to cultured cells and are very useful for cell proliferation assays that are 
typically 3-5 days in length. (Additional description of useful lipids is provided above, and 
those skilled in the art are also familiar with a variety of lipids mat can be used for delivery of 
oligonucleotide to cells in culture.) Initially, this panel of lipid delivery vehicles is screened 
in SKBR-3 and T47D cells using previously established control oligonucleotides. Specific 
lipids and conditions for optimal delivery are selected for each cell line based on these 
screens. These conditions are used to deliver HER2 specific DNAzymes to cells for primary 
(inhibition of cell proliferation) and secondary (decrease in HER2 protein) efficacy endpoints. 

Primary Screen: Inhibition of Cell Proliferation 

DNAzyme screens are performed using an automated, high throughput 96-well cell 
proliferation assay. Cell proliferation is measured over a 5-day treatment period using the 
nucleic acid stain CyQuant® for determining cell density. The growth of cells treated with 
DNAzyme/lipid complexes is compared to both untreated cells and to cells treated with 
Scrambled-arm Attenuated core Controls. SACs can no longer bind to the target site due to 
the scrambled arm sequence and have nucleotide changes in the core that greatly diminish 
DNAzyme cleavage. These SACs are used to determine non-specific inhibition of cell 
growth caused by DNAzyme chemistry (i.e. multiple T O-Me modified nucleotides and a 3' 
inverted abasic). Lead DNAzymes are chosen from the primary screen based on their ability 
to inhibit cell proliferation in a specific manner. Dose response assays are carried out on 
these leads and a subset was advanced into a secondary screen using the level of HER2 
protein as an endpoint. 

Secondary Screen: Decrease in HER2 Protein and/or SNA 

A secondary screen that measures the effect of anti-HER2 DNAzymes on HER2 protein 
and/or RNA levels is used to affirm preliminary findings. A robust HER2 ELISA for both 
T47D and SKBR-3 cells has been established and is available for use as an additional 
endpoint. In addition, a real time RT-PCR assay (TaqMan assay) has been developed to 
assess HER2 RNA reduction compared to an actin RNA control. Dose response activity of 
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nucleic acid molecules of the instant invention can be used to assess both HER2 protein and 
RNA reduction endpoints. 

- DNAzyme Mechanism Assays 

A TaqMan® assay for measuring the DNAzyme-mediated decrease in HER2 RNA has 
5 also been established. This assay is based on PCR technology and can measure in real time 
the production of HER2 mRNA relative to a standard cellular mRNA such as GAPDH. This 
RNA assay is used to establish proof that lead DNAzymes are working through an RNA 
cleavage mechanism and result in a decrease in the level of HER2 mRNA, thus leading to a 
decrease in cell surface HER2 protein receptors and a subsequent decrease in tumor cell 
1 0 - .proliferation. 

Animal Models 

Evaluating the efficacy of anti-HER2 agents in animal models is an important 
prerequisite to human clinical trials. As in cell culture models, the most HER2 sensitive 
mouse tumor xenografts are those derived from human breast carcinoma cells that express 
1 5 high levels of HER2 protein. In a recent study, nude mice bearing BT-474 xenografts were 
sensitive to the anti-HER2 humanized monoclonal antibody Herceptin®, resulting in an 80% 
inhibition of tumor growth at a 1 mg kg dose (ip, 2 X week for 4-5 weeks). Tumor 
eradication was observed in 3 of 8 mice treated in this manner (Baselga, J., Norton, L. 
Albanell, J., Kim, Y.M. and Mendelsohn, J. (1998) Recombinant humanized anti-HER2 
20 antibody (Herceptin) enhances the antitumor activity of paclitaxel and doxorubicin against 
„ . HER2/neu overexpressing human breast cancer xenografts. Cancer Res. 15: 2825-2831). 
This same study compared the efficacy of Herceptin® alone or in combination with the 
. .commonly used chemotherapeutics, paclitaxel or doxorubicin. Although, all three anti-HER2 
agents caused modest inhibition of tumor growth, the greatest antitumor activity was 
25 produced by the combination of Herceptin® and paclitaxel (93% inhibition of tumor growth 
vs 35% with paclitaxel alone). The above studies provide proof that inhibition of HER2 
expression by anti-HER2 agents causes inhibition of tumor growth in animals. Lead anti- 
HER2 DNAzymes chosen from in vitro assays are further tested in mouse xenograft models. 
~ DNAzymes are first tested alone and then in combination with standard chemotherapies. 

30 Animal Model Development 

Three human breast tumor cell lines (T47D, SKBR-3 and BT-474) were characterized 
to establish their growth curves in mice. These three cell lines have been implanted into the 
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mammary papillae of both nude and SCID mice and primary tumor volumes are measured 3 
times per week. Growth characteristics of these tumor lines using a Matrigel implantation 
format can also be established. The use of two other breast cell lines that have been 
engineered to express high levels of HER2 can also be used in the described studies. The 
5 tumor cell line(s) and implantation method that supports the most consistent and reliable 
tumor growth is used in animal studies testing the lead HER2 DNAzyme(s). DNAzymes are 
admmistered by daily subcutaneous injection or by continuous subcutaneous infusion from 
Alzet mini osmotic pumps beginning 3 days after tumor implantation and continuing for the 
duration of the study. Group sizes of at least 10 animals are employed. Efficacy is 
10 determined by statistical comparison of tumor volume of DNAzyme-treated animals to a 
control group of animals treated with saline alone. Because the growth of these tumors is 
generally slow (45-60 days), an initial endpoint is the time in days it takes to establish an 
easily measurable primary tumor (i.e. 50-100 mm 3 ) in the presence or absence of DNAzyme 
treatment. 



15 Clinical Summary 

Overview 

Breast cancer is a common cancer in women and also occurs in men to a lesser degree. 
The incidence of breast cancer in the United States is -180,000 cases per year and -46,000 
die each year of the disease. In addition, 21,000 new cases of ovarian cancer per year lead to 

20 -13,000 deaths (data from Hung, M.-C., Matin, A., Zhang, Y., Xing, X., Sorgi, F., Huang, L. 
and Yu, D. (1995) HER-2/neu-targeting gene therapy - a review. Gene 159: 65-71 and the 
Surveillance, Epidemiology and End Results Program, NCI Surveillance, Epidemiology and 
End Results Program (SEER) Cancer Statistics Review: 
http://vmw.seer.iins.nci.nih.gov/Publications/CSR1973_1996/). Ovarian cancer is a potential 

25 secondary indication for anti-HER2 DNAzyme therapy. 

A full review of breast cancer is given in the NCI PDQ for Breast Cancer (NCI 
PDQ/Treatment/Health Professionals/Breast Cancer: 

ht^://cancemet.nci.nm.gov/clmpdq/soa/Breast_cancer_Physician.html; NCI 
PDQ/Treatment/Patients/Breast Cancer: 

30 htt p://cancemet.nci.nih.gov/clinpdq/pifyBreast cance r Patienthtml). A brief overview is 
given here. Breast cancer is evaluated or "staged" on the basis of tumor size, and whether it 
has spread to lymph nodes and/or other parts of the body. In Stage I breast cancer, the cancer 
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is no larger than 2 centimeters and has not spread outside of the breast. In Stage n, the 
patient's tumor is 2-5 centimeters but cancer may have spread to the axillary lymph nodes. 
By Stage HI, metastasis to the lymph nodes is typical, and tumors are > 5 centimeters. 
Additional tissue involvement (skin, chest wall, ribs, muscles etc.) may also be noted. Once 
5 cancer has spread to additional organs of the body, it is classed as Stage IV." 

Almost all breast cancers (>90%) are detected at Stage I or n, but 31% of these are 
already lymph node positive. The 5-year survival rate for node negative patients (with 
standard surgery/radiation/chemotherapy /hormone regimens) is 97%; however, involvement 
of the lymph nodes reduces the 5-year survival to only 77%. Involvement of other organs 
10 (>Stage IH) drastically reduces the overall survival, to 22% at 5 years. Thus, chance of 
recovery from breast cancer is highly dependent on early detection. Because up to 10% of 
breast cancers are hereditary, those with a family history are considered to be at high risk for 
breast cancer and should be monitored very closely. 

Therapy 

15 Breast cancer is highly treatable and often curable when detected in the early stages. 

(For a complete review of breast cancer treatments, see the NCI PDQ for Breast Cancer.) 
Common therapies include surgery, radiation therapy, chemotherapy and hormonal therapy. 
Depending upon many factors, including the tumor size, lymph node involvement and 
location of the lesion, surgical removal varies from lumpectomy (removal of the tumor and 

10 some surrounding tissue) to mastectomy (removal of the breast, lymph nodes and some or all 
of the underlying chest muscle). Even with successful surgical resection, as many as 21% of 
the patients may ultimately relapse (10-20 years). Thus, once local disease is controlled by 
surgery, adjuvant radiation treatments, chemotherapies and/or hormonal therapies are 
typically used to reduce the rate of recurrence and improve survival. The therapy regimen 

25 employed depends not only on the stage of the cancer at its time of removal, but other 
variables such the type of cancer (ductal or lobular), whether lymph nodes were involved and 
removed, age and general health of the patient and if other organs are involved 

Common chemotherapies include various combinations of cytotoxic drugs to kill the 
cancer cells. These drugs include paclitaxel (Taxol), docetaxel, cisplatin, methotrexate, 
30 cyclophosphamide, doxorubin, fluorouracil etc. Significant toxicities are associated with 
these cytotoxic therapies. Well-characterized toxicities include nausea and vomiting, 
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myelosuppression, alopecia and mucosity. Serious cardiac problems are also associated with 
certain of the combinations, e.g. doxorubin and paclitaxel, but are less common. 

Testing for estrogen and progesterone receptors helps to determine whether certain anti- 
hormone therapies might be helpful in inhibiting tumor growth. If either or both receptors are 

5 present, therapies to interfere with the action of the hormone ligands, can be given in 
combination with chemotherapy and are generally continued for several years. These 
adjuvant therapies are called SERMs, selective estrogen receptor modulators, and they can 
give beneficial estrogen-like effects on bone and lipid metabolism while antagonizing 
estrogen in reproductive tissues. Tamoxifen is one such compound. The primary toxic effect 
1 0 associated with tire use of tamoxifen is a 2 to 7-fold increase in the rate of endometrial cancer. 
Blood clots in the legs and lung and the possibility of stroke are additional side effects. 
However, tamoxifen has been determined to reduce breast cancer incidence by 49% in high- 
risk patients and an extensive, somewhat controversial, clinical study is underway to expand 
the prophylactic use of tamoxifen. Another SERM, raloxifene, was also shown to reduce the 

15 incidence of breast cancer in a large clinical trial where it was being used to treat 
osteoporosis. In additional studies, removal of the ovaries and/or drugs to keep the ovaries 
from working are being tested. 

Bone marrow transplantation Is being studied in clinical trials for breast cancers that 
have become resistant to traditional chemotherapies of where >3 lymph nodes are involved. 
20 Marrow is removed from the patient prior to high-dose chemotherapy to protect it from being 
destroyed, and then replaced after the chemotherapy. Another type of 'transplant- ' involves 
the exogenous treatment of peripheral blood stem cells with drugs to kill cancer cells prior to 
replacing the treated cells in the bloodstream. 

One biological treatment, a humanized monoclonal anti-HER2 antibody, Herceptin® 
25 (Genentech) has been approved by the FDA as an additional treatment for HER2 positive 
tumors. Herceptin® binds with high affinity to the extracellular domain of HER2 and thus 
blocks its signaling action. Herceptin® can be used alone or in combination with 
chemotherapeutics (i.e. paclitaxel, docetaxel, cisplatin, etc.) (Pegram, M.D., Lipton, A., 
Hayes, D.F., Weber, B.L., Baselga, J.M., Tripathy, D., Baly, D., Baughman, S.A., Twaddell, 
30 T., Glaspy, J.A. and Slamon, DJ. (1998) Phase H study of receptor-enhanced 
chemosensitivity using recombinant humanized anti-pl85HER2/neu monoclonal antibody 
plus cisplatin in patients with HER2/neu-overexpressing metastatic breast cancer refractory to 
chemotherapy treatment. J. Clin. Oncol 16: 2659-2671). In Phase m studies, Herceptin® 
significantly improved the response rate to chemotherapy as well as improving the tune to 
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progression (Ross, J.S. and Fletcher, LA. (1998) The HER-2/neu oncogene in breast cancer: 
Prognostic factor, predictive factor and target for therapy. Oncologist 3: 1998). The most 
common side effects attributed to Herceptin® are fever and chills, pain, asthenia, nausea, 
vomiting, increased cough, diarrhea, headache, dyspnea, infection, rhinitis, and insomnia. 
Herceptin® in combination with chemotherapy (paclitaxel) can lead to cardiotoxicity 
(Sparano, J.A. (1999) Doxorubicin/taxane combinations: Cardiac toxicity and 
pharmacokinetics. Semin. Oncol. 26: 14-19), leukopenia, anemia, diarrhea, abdominal pain 
and infection. 

HER2 Protein Levels for Patient Screening and as a Potential Endpoint 

Because elevated HER2 levels can be detected in at least 30% of breast cancers, breast 
cancer patients can be pre-screened for elevated HER2 prior to admission to initial clinical 
trials testing an anti-HER2 DNAzyme. Initial HER2 levels can be determined (by ELISA) 
from tumor biopsies or resected tumor samples. 

During clinical trials, it may be possible to monitor circulating HER2 protein by ELISA 
(Ross and Fletcher, 1998). Evaluation of serial blood/serum samples over the course of the 
anti-HER2 DNAzyme treatment period could be useful in determining early indications of 
efficacy. In fact, the clinical course of Stage IV breast cancer was correlated with shed HER2 
protein fragment following a dose-intensified paclitaxel monotherapy. In all responders, the 
HER2 serum level decreased below the detection limit (Luftner, D., Schnabel. S. and 
Possinger, K. (1999) c-erbB-2 in serum of patients receiving fractionated paclitaxel 
chemotherapy. Int. J. Biol Markers 14: 55-59). 

Two cancer-associated antigens, CA27.29 and CA15.3, can also be measured in the 
serum. Both of these glycoproteins have been used as diagnostic markers for breast cancer. 
CA27.29 levels are higher than CA15.3 in breast cancer patients; the reverse is true in healthy 
individuals. Of these two markers, CA27.29 was found to better discriminate primary cancer 
from healthy subjects. In addition, a statistically significant and direct relationship was 
shown between CA27.29 and large vs small tumors and node postive vs node negative disease 
(Gion, M., Mione, R., Leon, A.E. and Dittadi, R. (1999) Comparison of the diagnostic 
accuracy of CA27.29 and CA15.3 in primary breast cancer. Clin. Chem. 45: 630-637). 
Moreover, both cancer antigens were found to be suitable for the detection of possible 
metastases during follow-up (Rodriguez de Paterna, L., Arnaiz, R, Estenoz, J. Ortuno, B. and 
Lanzos E. (1999) Study of serum tumor markers CEA, CA15.3, CA27.29 as diagnostic 
parameters in patients with breast carcinoma. Int. J. Biol. Markers 10: 24-29). Thus, 
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blocking breast tumor growth may be reflected in lower CA27.29 and/or CA15.3 levels 
compared to a control group. FDA submissions for me use of CA27.29 and CA15.3 for 
monitoring metastatic breast cancer patients have been filed (reviewed in Beveridge, R.A. 
(1999) Review of clinical studies of CA27.29 in breast cancer management. Int. J. Biol. 
5 Markers 14: 36-39). Fully automated methods for measurement of either of these markers 
are commercially available. 

Indications 

Particular degenerative and disease states that can be associated with HER2 expression 
modulation include but are not limited to cancer, for example breast cancer and ovarian 
1 0 cancer and/or any other diseases or conditions that are related to or will respond to the levels 
of HER2 in a cell or tissue, alone or in combination with other therapies 

The present body of knowledge in HER2 research indicates the need for methods to 
assay HER2 activity and for compounds that can regulate HER2 expression for research, 
diagnostic, and therapeutic use. 

15 The use of monoclonal antibodies, chemotherapy, radiation therapy, and analgesics, are 

all non-limiting examples of methods that can be combined with or used in conjunction with 
the nucleic acid molecules (e.g. DNAzymes) of the instant invention. Common 
chemotherapies that can be combined with nucleic acid molecules of the instant invention 
include various combinations of cytotoxic drugs to kill cancer cells. These drugs include but 

10 are not limited to paclitaxel (Taxol), docetaxel, cisplatin, methotrexate, cyclophosphamide, 
doxorubin, fluorouracil carboplatin, edatrexate, gemcitabine, vinorelbine etc. Those skilled 
in the art will recognize that other drug compounds and therapies can be similarly be readily 
combined with the nucleic acid molecules of the instant invention (e.g. DNAzyme molecules) 
are hence within the scope of the instant invention. 

15 Diagnostic uses 

The nucleic acid molecules of this invention (e.g., enzymatic nucleic acid molecules) 
can be used as diagnostic tools to examine genetic drift and mutations within diseased cells or 
to detect the presence of HER2 RNA in a cell. The close relationship between enzymatic 
nucleic acid molecule activity and the structure of the target RNA allows the detection of 
50 mutations in any region of the molecule that alters the base-pairing and three-dimensional 
structure of the target RNA. By using multiple enzymatic nucleic acid molecules described in 
this invention, one can map nucleotide changes which are important to RNA structure and 
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function in vitro, as well as in cells and tissues. Cleavage of target RNAs with enzymatic 
nucleic acid molecules can be used to inhibit gene expression and define the role (essentially) 
of specified gene products in the progression of disease. In this manner, other genetic targets 
can be defined as important mediators of the disease. These experiments can lead to better 
5 treatment of the disease progression by affording the possibility of combinational therapies 
{e.g., multiple enzymatic nucleic acid molecules targeted to different genes, enzymatic 
nucleic acid molecules coupled with known small molecule inhibitors, or intermittent 
treatment with combinations of enzymatic nucleic acid molecules and/or other chemical or 
biological molecules). Other in vitro uses of enzymatic nucleic acid molecules of this 
10 invention are well known in the art, and include detection of the presence of mRNAs 
associated with HER2-related condition. Such RNA is detected by determining the presence 
of a cleavage product after treatment with an enzymatic nucleic acid molecule using standard 
methodology. 

In a specific example, enzymatic nucleic acid molecules that cleave only wild-type or 

1 5 mutant forms of the target RNA are used for the assay. The first enzymatic nucleic acid 
molecule is used to identify wild-type RNA present in the sample and the second enzymatic 
nucleic acid molecule is used to identify mutant RNA in the sample. As reaction controls, 
synthetic substrates of both wild-type and mutant RNA are cleaved by both enzymatic nucleic 
acid molecules to demonstrate the relative enzymatic nucleic acid molecule efficiencies in the 

20 reactions and the absence of cleavage of the "non-targeted" RNA species. The cleavage 
products from the synthetic substrates also serve to generate size markers for the analysis of 
wild-type and mutant RNAs in the sample population. Thus each analysis requires two 
enzymatic nucleic acid molecules, two substrates and one unknown sample which is 
combined into six reactions. The presence of cleavage products is determined using an 

25 RNAse protection assay so that full-length and cleavage fragments of each RNA can be 
analyzed in one lane of a polyacrylamide gel. It is not absolutely required to quantify the 
results to gain insight into the expression of mutant RNAs and putative risk of the desired 
phenotypic changes in target cells. The expression of mRNA whose protein product is 
implicated in the development of the phenotype (/.<?., HER2) is adequate to establish risk. If 

SO probes of comparable specific activity are used for both transcripts, then a qualitative 
comparison of RNA levels will be adequate and will decrease the cost of the initial diagnosis. 
Higher mutant form to wild-type ratios are correlated with higher risk whether RNA levels 
are compared qualitatively or quantitatively. The use of enzymatic nucleic acid molecules in 
diagnostic applications contemplated by the instant invention is more fully described in 

35 George et al, US Patent Nos. 5,834,186 and 5,741,679, Shih et aL, US Patent No. 5,589,332, 
Nathan et al, US Patent No 5,871,914, Nathan and Ellington, International PCT publication 
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No. WO 00/24931, Breaker et al, International PCT Publication Nos. WO 00/26226 
98/27104, and Sullenger etal, International PCT publication No. WO 99/29842. 



Additional Uses 

5 Potential uses of sequence-specific enzymatic nucleic acid molecules of the instant 

invention can have many of the same applications for the study of RNA that DNA restriction 
endonucleases have for the study of DNA (Nathans et al, 1975 Ann. Rev. Biochem. 44:273). 
For example, the pattern of restriction fragments can be used to establish sequence 
relationships between two related RNAs, and large RNAs can be specifically cleaved to 

1 0 fragments of a size more useful for study. The ability to engineer sequence specificity of the 
enzymatic nucleic acid molecule is ideal for cleavage of RNAs of unknown sequence. 
Applicant has described the use of nucleic acid molecules to modulate gene expression of 
target genes in bacterial, microbial, fungal, viral, and eukaryotic systems including plant or 
mammalian cells. 

1 5 All patents and publications mentioned in the specification are indicative of the levels of 

skill of those skilled in the art to which the invention pertains. All references cited in this 
disclosure are incorporated by reference to the same extent as if each reference had been 
incorporated by reference in its entirety individually. 

One skilled in the art would readily appreciate that the present invention is well adapted 
20 to carry out the objects and obtain the ends and advantages mentioned, as well as those 
inherent therein. The methods and compositions described herein as presently representative 
of preferred embodiments are exemplary and are not intended as limitations on the scope of 
the invention. Changes therein and other uses will occur to those skilled in the art, which are 
encompassed within the spirit of the invention, are defined by the scope of the claims. 

25 It will be readily apparent to one skilled in the art that varying substitutions and 

modifications can be made to the invention disclosed herein without departing from the scope 
and spirit of the invention. Thus, such additional embodiments are within the scope of the 
present invention and the following claims. 

The invention illustratively described herein suitably can be practiced in the absence of 
30 any element or elements, limitation or limitations which is not specifically disclosed herein. 
Thus, for example, in each instance herein any of the terms "comprising", "consisting 
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essentially of and "consisting of" can be replaced with either of the other two terms. The 
terms and expressions that have been employed are used as terms of description and not of 
limitation, and there is no intention that in the use of such terms and expressions of excluding 
any equivalents of the features shown and described or portions thereof, but it is recognized 
5 that various modifications are possible within the scope of the invention claimed. Thus, it 
should be understood that although the present invention has been specifically disclosed by 
preferred embodiments, optional features, modification and variation of the concepts herein 
disclosed can be resorted to by those skilled in the art, and that such modifications and 
variations are considered to be within the scope of this invention as defined by the description 
1 0 and the appended claims. 

In addition, where features or aspects of the invention are described in terms of 
Markush groups or other grouping of alternatives, those skilled in the art will recognize that 
the invention is also thereby described in terms of any individual member or subgroup of 
members of the Markush group or other group. 

1 5 Other embodiments are within the claims that follow. 
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Table I: 



Reagent 



Phosphoramidites 



S-Ethyl Tetrazole 



Acetic Anhydride 



N-Methyl 
Imidazole 



TCA 



Iodine 



Beaucage 



Acetonitrile 



A. 2.5 umol Synthesis Cycle AB1 394 Instrument 



Equivalents 



6.5 



23.8 



100 



186 



176 



11.2 



12.9 



NA 



Amount 



163 pL 



238 uL 



233 pL 



233 pL 



2.3 mL 



1.7 mL 



645 pL 



6.67 mL 



Wait Time* DNA 



45 sec 



45 sec 



Wait Time* 2'-0-methyl 



2.5 min 



2.5 min 



5 sec 



5 sec 



21 sec 



45 sec 



100 sec 



NA 



5 sec 



5 sec 



21 sec 



45 sec 



300 sec 
NA 



WaitTime*RNA 



7.5 min 



7.5 min 



5 sec 
5 sec 



21 sec 
45 sec 



300 sec 



NA 



B. 0.2 umol Synthesis Cycle ABl 3 94 Instrument 



Reagent 



Equivalents 



Amount 



Wait Time* DNA 



Wait Time* 2*-0-methyl 



WaitTime*RNA 



Phosphoramidites 



15 



31 pL 



45 sec 



S-Ethyl Tetrazole 



38.7 



31 pL 



45 sec 



Acetic Anhydride 



655 



124 pL 



5 sec 



/s/-Methyl 
Imidazole 



1245 



124 pL 



5 sec 



233 sec 



465 sec 



233 min 



465 sec 



5 sec 



5 sec 



5 sec 
10 sec 



5 sec 
10 sec 



TCA 



700 



732 pL 



10 sec 



Iodine 



Beaucage 



20.6 
7.7 



244 pL 



Acetonitrile 



NA 



232 pL 



2.64 mL 



15 sec 



15 sec 



100 sec 



300 sec 



NA 



NA 



15 sec 



300 sec 



NA 



Reagent 


c 

Equivatents:DNA/ 
2 , -O-methyl/Rib0 


. u.z umol oynmesis wyui 
Amount: DNA/2'O- 
methyl/Ribo 


Wait Time* DNA 


Wait Time* 2'-0- 
methyl 


Wait Time* 
Ribo 


Phosphoramidites 


22/33/66 


40/60/120 pL 


60 sec 


180 sec 


360sec 


S-Ethyl Tetrazole 


70/105/210 


40/60/120 pL 


60 sec 


180 min 


360 sec 


Acetic Anhydride 


265/265/265 


50/50/50 pL 


10 sec 


10 sec 


10 sec 


/^Methyl 
Imidazole 


502/502/502 


50/50/50 pL 


10 sec 


10 sec 


10 sec 


TCA 


238/475/475 


250/500/500 pL 


15 sec 


15 sec 


15 sec 


Iodine 


6.8/6.8/6.8 


80/80/80 pL 


30 sec 


30 sec 


30 sec 


Beaucage 


34/51/51 


80/120/120 


100 sec 


200 sec 


200 sec 


Acetonitrile 


NA 


1150/1150/1150 pL 


J NA 


NA 


NA 



Wait time does not include contact time during delivery. 
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Table II: Human K-Ras DNAzyme and Substrate Sequence 



Pos 


Substrate 


Seq 
ID 


DNAzyme 


Seq 
ID 


10 


CCUAGGCG G CGGCCGCG 


1 


CGCGGCCG GGCTAGCTACAACGA CGCCTAGG 


2329 


13 


AGGCGGCG G CCGCGGCG 


2 


CGCCGCGG GGCTAGCTACAACGA CGCCGCCT 


2330 


16 


CGGCGGCC G CGGCGGCG 


3 


CGCCGCCG GGCTAGCTACAACGA GGCCGCCG 


2331 


19 


CGGCCGCG G CGGCGGAG 


4 


CTCCGCCG GGCTAGCTACAACGA CGCGGCCG 


2332 


22 


CCGCGGCG G CGGAGGCA 


5 


TGCCTCCG GGCTAGCTACAACGA CGCCGCGG 


2333 


28 


CGGCGGAG G CAGCAGCG 


6 


CGCTGCTG GGCTAGCTACAACGA CTCCGCCG 


2334 I 


31 


CGGAGGCA G CAGCGGCG 


7 


CGCCGCTG GGCTAGCTACAACGA TGCCTCCG 


2335 


34 


AGGCAGCA G CGGCGGCG 


8 


CGCCGCCG GGCTAGCTACAACGA TGCTGCCT 


2336 


37 


CAGCAGCG G CGGCGGCA 


9 


TGCCGCCG GGCTAGCTACAACGA CGCTGCTG 


2337 


40 


CAGCGGCG G CGGCAGUG 


10 


CACTGCCG GGCTAGCTACAACGA CGCCGCTG 


2338 


43 


CGGCGGCG G CAGUGGCG 


11 


CGCCACTG GGCTAGCTACAACGA CGCCGCCG 


2339 


46 


CGGCGGCA G UGGCGGCG 


12 


CGCCGCCA GGCTAGCTACAACGA TGCCGCCG 


2340 


49 


CGGCAGUG G CGGCGGCG 


13 


CGCCGCCG GGCTAGCTACAACGA CACTGCCG 


2341 


52 


CAGUGGCG G CGGCGAAG 


14 


CTTCGCCG GGCTAGCTACAACGA CGCCACTG 


2342 


55 


UGGCGGCG G CGAAGGUG 


15 


CACCTTCG GGCTAGCTACAACGA CGCCGCCA 


2343 


61 


CGGCGAAG G UGGCGGCG 


16 


CGCCGCCA GGCTAGCTACAACGA CTTCGCCG 


2344 


64 


CGAAGGUG G CGGCGGCU 


17 


AGCCGCCG GGCTAGCTACAACGA CACCTTCG 


2345 


67 


AGGUGGCG G CGGCUCGG 


18 


CCGAGCCG GGCTAGCTACAACGA CGCCACCT 


2346 


70 


UGGCGGCG G CUCGGCCA 


19 


TGGCCGAG GGCTAGCTACAACGA CGCCGCCA 


2347 


75 


GCGGCUCG G CCAGUACU 


20 


AGTACTGG GGCTAGCTACAACGA CGAGCCGC 


2348 


79 


CUCGGCCA G UACUCCCG 


21 


CGGGAGTA GGCTAGCTACAACGA TGGCCGAG 


2349 


81 


CGGCCAGU A CUCCCGGC 


22 


GCCGGGAG GGCTAGCTACAACGA ACTGGCCG 


2350 


88 


UACUCCCG G CCCCCGCC 


23 


GGCGGGGG GGCTAGCTACAACGA CGGGAGTA 


2351 


94 


CGGCCCCC G CCAUUUCG 


24 


CGAAATGG GGCTAGCTACAACGA GGGGGCCG 


2352 


97 


CCCCCGCC A UUUCGGAC 


25 


GTCCGAAA GGCTAGCTACAACGA GGCGGGGG 


2353 


104 


CAUUUCGG A CUGGGAGC 


26 


GCTCCCAG GGCTAGCTACAACGA CCGAAATG 


2354 


111 


GACUGGGA G CGAGCGCG 


27 


CGCGCTCG GGCTAGCTACAACGA TCCCAGTC 


2355 


115 


GGGAGCGA G CGCGGCGC 


28 


GCGCCGCG GGCTAGCTACAACGA TCGCTCCC 


2356 


117 


GAGCGAGC G CGGCGGAG 


29 


CTGCGCCG GGCTAGCTACAACGA GCTCGCTC 


2357 


120 


CGAGCGCG G CGCAGGCA 


30 


TGCCTGCG GGCTAGCTACAACGA CGCGCTCG 


2358 


122 


AGCGCGGC G CAGGCACU 


31 


AGTGCCTG GGCTAGCTACAACGA GCCGCGCT 


2359 


126 


CGGCGCAG G CACUGAAG 


32 


CTTCAGTG GGCTAGCTACAACGA CTGCGCCG 


2360 


128 


GCGCAGGC A CUGAAGGC 


33 


GCCTTCAG GGCTAGCTACAACGA GCCTGCGC 


2361 


135 


CACUGAAG G CGGCGGCG 


34 


CGCCGCCG GGCTAGCTACAACGA CTTCAGTG 


2362 


138 


UGAAGGCG G CGGCGGGG 


35 


CCCCGCCG GGCTAGCTACAACGA CGCCTTCA 


2363 


141 


AGGCGGCG G CGGGGCCA 


36 


TGGCCCCG GGCTAGCTACAACGA CGCCGCCT 


2364 


146 


GCGGCGGG G CCAGAGGC 


37 


GCCTCTGG GGCTAGCTACAACGA CCCGCCGC 


2365 


153 


GGCCAGAG G CUCAGCGG 


38 


CCGCTGAG GGCTAGCTACAACGA CTCTGGCC 


2366 


158 


GAGGCUCA G CGGCUCCC 


39 


GGGAGCCG GGCTAGCTACAACGA TGAGCCTC 


2367 


161 


GCUCAGCG G CUCCCAGG 


40 


CCTGGGAG GGCTAGCTACAACGA CGCTGAGC 


2368 


169 


GCUCCCAG G UGCGGGAG 


41 


CTCCCGCA GGCTAGCTACAACGA CTGGGAGC 


2369 


171 


UCCCAGGU G CGGGAGAG 


42 


CTCTCCCG GGCTAGCTACAACGA AC CTGGG A 


2370 | 


182 


GGAGAGAG G CCUGCUGA 


43 


TCAGCAGG GGCTAGCTACAACGA CTCTCTCC 


2371 


186 


AGAGGCCU G CUGAAAAU 


44 


ATTTTCAG GGCTAGCTACAACGA AGGCCTCT 


2372 


193 


UGCUGAAA A UGACUGAA 


45 


TTCAGTCA GGCTAGCTACAACGA TTTCAGCA 


2373 


196 


UGAAAAUG A CUGAAUAU 


46 


ATATTCAG GGCTAGCTACAACGA CATTTTCA 


2374 


201 


AUGACUGA A UAUAAACU 


47 


AGTTTATA GGCTAGCTACAACGA TCAGTCAT 


2375 


203 


GACUGAAU A UAAACUUG 


48 


CAAGTTTA GGCTAGCTACAACGA ATTCAGTC 


2376 
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207 


GAAUAUAA A CUUGUGGU 


49 


APPACAAG GGCTAGCTACAACGA TTATATTC 


2377 


211 


AUAAACUU G UGGUAGUU 


50 


AACTACCA GGCTAGCTACAACGA AAGTTTAT 


2378 


214 


AACUUGUG G UAGUUGGA 


51 


TCCAACTA GGCTAGCTACAACGA CACAAGTT 


2379 


217 


UUGUGGUA G UUGGAGCU 


52 


AGCTCCAA GGCTAGCTACAACGA TACCACAA 


2380 


223 


UAGUUGGA G CUUGUGGC 


53 


nrvarAAfl nGCTAGCTACAACGA TCCAACTA 


2381 


227 


UGGAGCUU G UGGCGUAG 


54 


CTACGCCA GGCTAGCTACAACGA AAGCTCCA 


2382 


230 


AGCUUGUG G CGUAGGCA 


55 


TGCCTACG GGCTAGCTACAACGA CACAAGCT 


2383 


232 


CUUGUGGC G UAGGCAAG 


56 


CTTGCCTA GGCTAGCTACAACGA GCCACAAG 


2384 


236 


UGGCGUAG G CAAGAGUG 


57 


CACTCTTG GGCTAGCTACAACGA CTACGCCA 


2385 


242 


AGGCAAGA G UGCCUUGA 


58 


TCAAGGCA GGCTAGCTACAACGA TCTTGCCT 


2386 


244 


GCAAGAGU G CCUUGACG 


59 


CGTCAAGG GGCTAGCTACAACGA ACTCTTGC 


2387 


250 


GUGCCUUG A CGAUACAG 


60 


CTGTATCG GGCTAGCTACAACGA CAAGGCAC 


2388 


253 


CCUUGACG A UACAGCUA 


61 


TAGCTGTA GGCTAGCTACAACGA CGTCAAGG 


2389 


255 


UUGACGAU A CAGCUAAU 


62 


ATTAGCTG GGCTAGCTACAACGA ATCGTCAA 


2390 


258 ! 


ACGAUACA G CUAAUUCA 


63 


TGAATTAG GGCTAGCTACAACGA TGTATCGT 


2391 


262 


UACAGCUA A UUCAGAAU 


64 


ATTCTGAA GGCTAGCTACAACGA TAGCTGTA 


2392 


269 


AAUUCAGA A UCAUUUUG 


65 


CAAAATGA GGCTAGCTACAACGA TCTGAATT 


2393 


272 


UCAGAAUC A UUUUGUGG 


66 


CCACAAAA GGCTAGCTACAACGA GATTCTGA 


2394 


277 


AUCAUUUU G UGGACGAA 


67 


TTCGTCCA GGCTAGCTACAACGA AAAATGAT 


2395 


281 1 


UUUUGUGG A CGAAUAUG 


68 


CATATTCG GGCTAGCTACAACGA CCACAAAA 


2396 


285 


GUGGACGA A UAUGAUCC 


69 


GGATCATA GGCTAGCTACAACGA TCGTCCAC 


2397 


287 


GGACGAAU A UGAUCCAA 


70 


TTGGATCA GGCTAGCTACAACGA ATTCGTCC 


2398 


290 


CGAAUAUG A UCCAACAA 


71 


TTGTTGGA GGCTAGCTACAACGA CATATTCG 


2399 . 


295 


AUGAUCCA A CAAUAGAG 


72 


CTCTATTG GGCTAGCTACAACGA TGGATCAT 


2400 


298 


AUCCAACA A UAGAGGAU 


73 


ATCCTCTA GGCTAGCTACAACGA TGTTGGAT 


2401 


305 


AAUAGAGG A UUCCUACA 


74 


TGTAGGAA GGCTAGCTACAACGA CCTCTATT 


2402 


311 


GGAUUCCU A CAGGAAGC 


75 


GCTTCCTG GGCTAGCTACAACGA AGGAATCC 


2403 


318 


UACAGGAA G CAAGUAGU 


76 


ACTACTTG GGCTAGCTACAACGA TTCCTGTA 


2404 


322 


GGAAGCAA G UAGUAAUU 


77 


AATTACTA GGCTAGCTACAACGA TTGCTTCC 


2405 


325 


AGCAAGUA G UAAUUGAU 


78 


ATCAATTA GGCTAGCTACAACGA TACTTGCT 


2406 


328 


AAGUAGUA A UUGAUGGA 


79 


TCCATCAA GGCTAGCTACAACGA TACTACTT 


2407 


332 


AGUAAUUG A UGGAGAAA 


80 


TTTCTCCA GGCTAGCTACAACGA CAATTACT 


2408 


340 


AUGGAGAA A CCUGUCUC 


81 


GAGACAGG GGCTAGCTACAACGA TTCTCCAT 


2409 


344 


AGAAACCU G UCUCUUGG 


82 


CCAAGAGA GGCTAGCTACAACGA AGGTTTCT 


2410 


353 


UCUCUUGG A UAUUCUCG 


83 


CGAGAATA GGCTAGCTACAACGA CCAAGAGA 


2411 


355 


UCUUGGAU A UUCUCGAC 


84 


GTCGAGAA GGCTAGCTACAACGA ATCCAAGA 


2412 


362 


UAUUCUCG A CACAGCAG 


85 


1 CTGCTGTG GGCTAGCTACAACGA CGAGAATA 


2413 


364 


UUCUCGAC A CAGCAGGU 


86 


ACCTGCTG GGCTAGCTACAACGA GTCGAGAA 


2414 


367 


UCGACACA G CAGGUCAA 


87 


TTGACCTG GGCTAGCTACAACGA TGTGTCGA 


2415 


371 


CACAGCAG G UCAAGAGG 


88 


CCTCTTGA GGCTAGCTACAACGA CTGCTGTG 


2416 


381 


CAAGAGGA G UACAGUGC 


89 


GCACTGTA GGCTAGCTACAACGA TCCTCTTG 


2417 


383 


AGAGGAGU A CAGUGCAA 


90 


TTGCACTG GGCTAGCTACAACGA ACTCCTCT 


2418 


386 


GGAGUACA G UGCAAUGA 


91 


TCATTGCA GGCTAGCTACAACGA TGTACTCC 


2419 


388 


AGUACAGU G CAAUGAGG 


92 


CCTCATTG GGCrrAGClAUfVAULaft. .tt.UJ.oi..«A-x 


2420 


391 


ACAGUGCA A UGAGGGAC 


93 


GTCCCTCA GGCTAGCTACAACGA TGCACTGT 


2421 


398 


AAUGAGGG A CCAGUACA 


94 


TGTACTGG GGCTAGCTACAACGA CCCTCATT 


2422 


402 


AGGGACCA G UACAUGAG 


95 


CTCATGTA GGCTAGCTACAACGA TGGTCCCT 


2423 


404 


GGACCAGU A CAUGAGGA 


96 


TCCTCATG GGCTAGCTACAACGA ACTGGTCC 


2424 


406 


ACCAGUAC A UGAGGACU 


97 


AGTCCTCA GGCTAGCTACAACGA GTACTGGT 


2425 


412 


ACAUGAGG A CUGGGGAG 


98 


CTCCCCAG GGCTAGCTACAACGA CCTCATGT 


2426 


422 


UGGGGAGG G CUUUCUUU 


99 


AAAGAAAG GGCTAGCTACAACGA CCTCCCCA 


2427 


431 


CUUUCUUU G UGUAUUUG 


100 


CAAATACA GGCTAGCTACAACGA AAAGAAAG 


2428 
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433 


UUCUUUGU G UAUUUGCC 


101 


GGCAAATA GGCTAGCTACAACGA ACAAAGAA 


2429 


435 


CUUUGUGU A UUUGCCAU 


102 


ATGGCAAA GGCTAGCTACAACGA ACACAAAG 


2430 


439 


GUGUAUUU G CCAUAAAU 


103 


ATTTATGG GGCTAGCTACAACGA AAATACAC 


2431 


442 


UAUUUGCC A UAAAUAAU 


104 


ATTATTTA GGCTAGCTACAACGA GGCAAATA 


2432 


446 


UGCCAUAA A . .UAAUACUA 


105 


TAGTATTA GGCTAGCTACAACGA TTATGGCA 


2433 


449 


CAUAAAUA A UACUAAAU 


106 


ATTTAGTA GGCTAGCTACAACGA TATTTATG 


2434 ! 


451 


UAAAUAAU A CUAAAUCA 


107 


TGATTTAG GGCTAGCTACAACGA ATTATTTA 


2435 ! 


! 456 


AAUACUAA A .. UCAUUUGA 


108 


TCAAATGA GGCTAGCTACAACGA TTAGTATT 


2436 


459 


ACUAAAUC A UUUGAAGA 


109 


TCTTCAAA GGCTAGCTACAACGA GATTTAGT 


2437 


467 


AUUUGAAG A UAUUCACC 


110 


GGTGAATA GGCTAGCTACAACGA CTTCAAAT 


2438 


469 


UUGAAGAU A UUCACCAU 


111 


ATGGTGAA GGCTAGCTACAACGA ATCTTCAA 


2439 


473 


AGAUAUUC.A CCAUUAUA 


112 


TATAATGG GGCTAGCTACAACGA GAATATCT 


2440 


476 


UAUUCACC A UUAUAGAG 


113 


CTCTATAA GGCTAGCTACAACGA GGTGAATA 


2441 


479 


UCACCAUU A. UAGAGAAC 


114 


GTTCTCTA GGCTAGCTACAACGA AATGGTGA 


2442 


486 


UAUAGAGA A CAAAUUAA 


115 


TTAATTTG GGCTAGCTACAACGA TCTCTATA 


2443 


490 


GAGAACAA A- UUAAAAGA 


116 


TCTTTTAA GGCTAGCTACAACGA TTGTTCTC 


2444 


499 


UUAAAAGA G UUAAGGAC 


117 


GTCCTTAA GGCTAGCTACAACGA TCTTTTAA 


2445 


506 


AGUUAAGG A CUCUGAAG 


118 


CTTCAGAG GGCTAGCTACAACGA CCTTAACT 


2446 


515 


CUCUGAAG A UGUACCUA 


119 


TAGGTACA GGCTAGCTACAACGA CTTCAGAG 


2447 


517 


CUGAAGAU G UACCUAUG 


120 


I CATAGGTA GGCTAGCTACAACGA ATCTTCAG 


2448 


519 


GAAGAUGU A CCUAUGGU 


121 


ACCATAGG GGCTAGCTACAACGA ACATCTTC 


2449 


523 


AUGUACCU A UGGUCCUA 


122 


TAGGACCA GGCTAGCTACAACGA AGGTACAT 


2450 


526 


UACCUAUG G UCCUAGUA 


123 


! TACTAGGA GGCTAGCTACAACGA CATAGGTA 


2451 


532 


UGGUCCUA G UAGGAAAU 


124 


i ATTTCCTA GGCTAGCTACAACGA TAGGACCA 


2452 


539 


AGUAGGAA A UAAAUGUG 


125 


CACATTTA GGCTAGCTACAACGA TTCCTACT 


2453 


543 


GGAAAUAA A UGUGAUUU 


126 


AAATCACA GGCTAGCTACAACGA TTATTTCC 


2454 


545 


AAAUAAAU G UGAUUUGC 


127 


GCAAATCA GGCTAGCTACAACGA ATTTATTT 


2455 


548 


UAAAUGUG A UUUGCCUU 


128 


AAGGCAAA GGCTAGCTACAACGA CACATTTA 


2456 


552 


UGUGAUUU G CCUUCUAG 


129 


CTAGAAGG GGCTAGCTACAACGA AAATCACA 


2457 


562 


CUUCUAGA A CAGUAGAC 


130 


GTCTACTG GGCTAGCTACAACGA TCTAGAAG 


2458 


565 


CUAGAACA G UAGACACA 


131 


TGTGTCTA GGCTAGCTACAACGA TGTTCTAG 


2459 


569 


AACAGUAG A CACAAAAC 


132 


GTTTTGTG GGCTAGCTACAACGA CTACTGTT 


2460 


571 


CAGUAGAC A CAAAACAG 


133 


CTGTTTTG GGCTAGCTACAACGA GTCTACTG 


2461 


576 


GACACAAA A CAGGCUCA 


134 


TGAGCCTG GGCTAGCTACAACGA TTTGTGTC 


2462 


580 


CAAAACAG G CUCAGGAC 


135 


GTCCTGAG GGCTAGCTACAACGA CTGTTTTG 


2463 


587 


GGCUCAGG A CUUAGCAA 


136 


TTGCTAAG GGCTAGCTACAACGA CCTGAGCC 


2464 


592 


AGGACUUA G CAAGAAGU 


137 


ACTTCTTG GGCTAGCTACAACGA TAAGTCCT 


2465 


599 


AGCAAGAA G UUAUGGAA 


138 


TTCCATAA GGCTAGCTACAACGA TTCTTGCT 


2466 


602 


AAGAAGUU A UGGAAUUC 


139 


GAATTCCA GGCTAGCTACAACGA AACTTCTT 


2467 


607 


GUUAUGGA A UUCCUUUU 


140 


AAAAGGAA GGCTAGCTACAACGA TCCATAAC 


2468 


616 


UUCCUUUU A UUGAAACA 


141 


TGTTTCAA GGCTAGCTACAACGA AAAAGGAA 


2469 


622 


UUAUUGAA A CAUCAGCA 


142 


TGCTGATG GGCTAGCTACAACGA TTCAATAA 


2470 


624 


AUUGAAAC A UCAGCAAA 


143 


TTTGCTGA GGCTAGCTACAACGA GTTTCAAT 


2471 


628 


AAACAUCA G CAAAGACA 


144 


TGTCTTTG GGCTAGCTACAACGA TGATGTTT 


2472 


634 


CAGCAAAG A CAAGACAG 


145 


CTGTCTTG GGCTAGCTACAACGA CTTTGCTG 


2473 


639 


AAGACAAG A CAGGGUGU 


146 


ACACCCTG GGCTAGCTACAACGA CTTGTCTT 


2474 


644 


AAGACAGG G UGUUGAUG 


147 


CATCAACA GGCTAGCTACAACGA CCTGTCTT 


2475 


646 


GACAGGGU G UUGAUGAU 


148 


ATCATCAA GGCTAGCTACAACGA ACCCTGTC 


2476 


650 


GGGUGUUG A UGAUGCCU 


149 


AGGCATCA GGCTAGCTACAACGA CAACACCC 


2477 


653 


UGUUGAUG A UGCCUUCU 


150 


AGAAGGCA GGCTAGCTACAACGA CATCAACA 


2478 


655 


UUGAUGAU G CCUUCUAU 


151 


ATAGAAGG GGCTAGCTACAACGA ATCATCAA 


2479 


662 


UGCCUUCU A UACAUUAG 


152 


CTAATGTA GGCTAGCTACAACGA AGAAGGCA 


2480 



BNSDOCID <WO 02097 11 4A2 I a 



WO 02/097114 



PCT/US02/16840 



88 



664 


CCUUCUAU A CAUUAGUU 1 


153 j 


AACTAATG GGCTAGCTACAACGA ATAGAAGG I 


2481 


666 


UUCUAUAC A UUAGUUCG 


154 j 


CGAACTAA GGCTAGCTACAACGA GTATAGAA 1 


2482 


670 


AUACAUUA G UUCGAGAA 1 


155 ! 


TTCTCGAA GGCTAGCTACAACGA TAATGTAT 1 


2483 


679 


UUCGAGAA A UUCGAAAA 1 


156 


TTTTCGAA GGCTAGCTACAACGA TTCTCGAA 


2484 


687 


AUUCGAAA A CAUAAAGA j 


157 


TCTTTATG GGCTAGCTACAACGA TTTCGAAT 1 


2485 


689 


UCGAAAAC A UAAAGAAA | 


158 


TTTCTTTA GGCTAGCTACAACGA GTTTTCGA I 


2486 


700 


AAGAAAAG A UGAGCAAA j 


159 


TTTGCTCA GGCTAGCTACAACGA CTTTTCTT 1 


2487 


704 


AAAGAUGA G CAAAGAUG 1 


160 j 


CATCTTTG GGCTAGCTACAACGA TCATCTTT j 


2488 


710 


GAGCAAAG A UGGUAAAA 1 


161 


TTTTACCA GGCTAGCTACAACGA CTTTGCTC j 


2489 


713 


CAAAGAUG G UAAAAAGA 1 


162 


TCTTTTTA GGCTAGCTACAACGA CATCTTTG 1 


2490 


732 


AAAAAGAA G UCAAAGAC 1 


163 


GTCTTTGA GGCTAGCTACAACGA TTCTTTTT 1 


2491 


739 


AGUCAAAG A CAAAGUGU | 


164 j 


ACACTTTG GGCTAGCTACAACGA CTTTGACT I 


2492 


744 


AAGACAAA G UGUGUAAU 


165 


ATTACACA GGCTAGCTACAACGA TTTGTCTT j 


2493 


746 


GACAAAGU G UGUAAUUA 1 


166 


TAATTACA GGCTAGCTACAACGA ACTTTGTC J 


2494 


748 


CAAAGUGU G UAAUUAUG 


167 1 


CATAATTA GGCTAGCTACAACGA ACACTTTG j 


2495 


751 


AGUGUGUA A UUAUGUAA j 


168 


TTACATAA GGCTAGCTACAACGA TACACACT 1 


2496 


754 


GUGUAAUU A UGUAAAUA 


169 


TATTTACA GGCTAGCTACAACGA AATTACAC \ 


2497 


756 


GUAAUUAU G UAAAUACA 1 


170 1 


TGTATTTA GGCTAGCTACAACGA ATAATTAC 1 


2498 


760 


UUAUGUAA A UACAAUUU 1 


171 


AAATTGTA GGCTAGCTACAACGA TTACATAA I 


2499 


762 


AUGUAAAU A CAAUUUGU 1 


172 j 


ACAAATTG GGCTAGCTACAACGA ATTTACAT 1 


2500 


765 


UAAAUACA A UUUGUACU 1 


173 


AGTACAAA GGCTAGCTACAACGA TGTATTTA j 


2501 


769 


UACAAUUU G UACUUUUU I 


174 


AAAAAGTA GGCTAGCTACAACGA AAATTGTA 1 


2502 


771 


CAAUUUGU A CUUUUUUC j 


175 


GAAAAAAG GGCTAGCTACAACGA ACAAATTG 1 


2503 


785 


UUCUUAAG G CAUACUAG 


176 1 


CTAGTATG GGCTAGCTACAACGA CTTAAGAA 


2504 


787 


CUUAAGGC A UACUAGUA 


177 


TACTAGTA GGCTAGCTACAACGA GCClTAAG 


2505 


789 


UAAGGCAU A CUAGUACA 


178 


TGTACTAG GGCTAGCTACAACGA ATGCCTTA 


2506 1 


793 


GCAUACUA G UACAAGUG 


179 


CACTTGTA GGCTAGCTACAACGA TAGTATGC 


2507 


795 


AUACUAGU A CAAGUGGU 


P^.80 


ACCACTTG GGCTAGCTACAACGA ACTAGTAT 


1 2508 


799 


UAGUACAA G UGGUAAUU 


| 181 


S AATTACCA GGCTAGCTACAACGA TTGTACTA 


1 2509 


802 


UACAAGUG G UAAUUUUU 


1 182 


AAAAATTA GGCTAGCTACAACGA CACTTGTA 


2510 


805 


AAGUGGUA A UUUUUGUA 


183 


TACAAAAA GGCTAGCTACAACGA TACCACTT 


2511 


811 


UAAUUUUU G UACAUUAC 


1 184 


1 GTAATGTA GGCTAGCTACAACGA AAAAATTA 


2512 


813 


AUUUUUGU A CAUUACAC 


185 


GTGTAATG GGCTAGCTACAACGA ACAAAAAT 


2513 


815 


UUUUGUAC A UUACACUA 


1 186 


1 TAGTGTAA GGCTAGCTACAACGA GTACAAAA 


2514 


818 


UGUACAUU A CACUAAAU 


187 


1 ATTTAGTG GGCTAGCTACAACGA AATGTACA 


2515 


820 


UACAUUAC A CUAAAUUA 


188 


j TAATTTAG GGCTAGCTACAACGA GTAATGTA 


| 2516 


825 


UACACUAA A UUAUUAGC 


189 


1 GCTAATAA GGCTAGCTACAACGA TTAGTGTA 


! 2517 


828 


ACUAAAUU A UUAGCAUU 


! 190 


AATGCTAA GGCTAGCTACAACGA AATTTAGT 


2518 


832 


AAUUAUUA G CAUUUGUU 


191 


AACAAATG GGCTAGCTACAACGA TAATAATT 


2519 


834 


UUAUUAGC A UUUGUUUU 


192 


1 AAAACAAA GGCTAGCTACAACGA GCTAATAA 


I 2520 


838 


UAGCAUUU G UUUUAGCA 


193 


TGCTAAAA GGCTAGCTACAACGA AAATGCTA 


| 2521 


844 


UUGUUUUA G CAUUACCU 


j 194 


| AGGTAATG GGCTAGCTACAACGA TAAAACAA 


2522 


846 


GUUUUAGC A UUACCUAA 


195 


| TTAGGTAA GGCTAGCTACAACGA GCTAAAAC 


1 2523 


849 


UUAGCAUU A CCUAAUUU 


196 


| AAATTAGG GGCTAGCTACAACGA AATGCTAA 


1 o 


854 


AUUACCUA A UUUUUUUC 


197 


I GAAAAAAA GGCTAGCTACAACGA TAGGTAAT 


1 2525 


865 


UUUUUCCU G CUCCAUGC 


| 198 


1 GCATGGAG GGCTAGCTACAACGA AGGAAAAA 


2526 


870 


CCUGCUCC A UGCAGACU 


I 199 


| AGTCTGCA GGCTAGCTACAACGA GGAGCAGG 


2527 


872 


UGCUCCAU G CAGACUGU 


200 


1 ACAGTCTG GGCTAGCTACAACGA ATGGAGCA 


2528 


876 


CCAUGCAG A CUGUUAGC 


201 


GCTAACAG GGCTAGCTACAACGA CTGCATGG 


2529 


879 


UGCAGACU G UUAGCUUU 


202 


AAAGCTAA GGCTAGCTACAACGA AGTCTGCA 


1 2530 


883 


GACUGUUA G CUUUUACC 


203 


1 GGTAAAAG GGCTAGCTACAACGA TAACAGTC 


! 2531 


889 


UAGCUUUU A CCUUAAAU 


1 204 


| ATTTAAGG GGCTAGCTACAACGA AAAAGCTA 


1 2532 
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896 


UACCUUAA A UGCUUAUU 


205 


AATAAGCA GGCTAGCTACAACGA TTAAGGTA 


2533 


898 


CCUUAAAU G CUUAUUUU 


206 


AAAATAAG GGCTAGCTACAACGA ATTTAAGG 


2534 


s 902 


AAAUGCUU A UUUUAAAA 


207 


TTTTAAAA GGCTAGCTACAACGA AAGCATTT 


2535 


910 


AUUUUAAA A UGACAGUG 


208 


CACTGTCA GGCTAGCTACAACGA TTTAAAAT 


2536 


913 


UUAAAAUG A CAGUGGAA 


209 


TTCCACTG GGCTAGCTACAACGA CATTTTAA 


2537 


916 


AAAUGACA G UGGAAGUU 


210 


AACTTCCA GGCTAGCTACAACGA TGTCATTT 


2538 


922 


CAGUGGAA G UUUUUUUU 


211 


AAAAAAAA GGCTAGCTACAACGA TTCCACTG 


2539 


939 


UCCUCGAA G UGCCAGUA 


212 


TACTGGCA GGCTAGCTACAACGA TTCGAGGA 


2540 


941 


CUCGAAGU G CCAGUAUU 


213 


AATACTGG GGCTAGCTACAACGA ACTTCGAG 


2541 


945 


AAGUGCCA G UAUUCCCA 


214 


TGGGAATA GGCTAGCTACAACGA TGGCACTT 


2542 


947 


GUGCCAGU A UUCCCAGA 


. 215 


TCTGGGAA GGCTAGCTACAACGA ACTGGCAC 


2543 


956 


UUCCCAGA G UUUUGGUU 


216 


AACCAAAA GGCTAGCTACAACGA TCTGGGAA 


2544 


962 


GAGUUUUG G UUUUUGAA 


217 


TTCAAAAA GGCTAGCTACAACGA CAAAACTC 


2545 


970 


GUUUUUGA A CUAGCAAU 


218 


ATTGCTAG GGCTAGCTACAACGA TCAAAAAC 


2546 


974 


UUGAACUA G CAAUGCCU 


219 


AGGCATTG GGCTAGCTACAACGA TAGTTCAA 


2547 


977 


AACUAGCA A UGCCUGUG 


220 


CACAGGCA GGCTAGCTACAACGA TGCTAGTT 


2548 


979 


CUAGCAAU G CCUGUGAA 


221 


TTCACAGG GGCTAGCTACAACGA ATTGCTAG 


2549 


983 


CAAUGCCU G UGAAAAAG 


222 


CTTTTTCA GGCTAGCTACAACGA AGGCATTG 


2550 


994 


AAAAAGAA A CUGAAUAC 


223 


GTATTCAG GGCTAGCTACAACGA TTCTTTTT 


2551 


999 


GAAACUGA A UACCUAAG 


224 


CTTAGGTA GGCTAGCTACAACGA TCAGTTTC 


2552 


1001 


AACUGAAU A CCUAAGAU 


225 


ATCTTAGG GGCTAGCTACAACGA ATTCAGTT 


2553 


1008 


UACCUAAG A UUUCUGUC 


226 


GACAGAAA GGCTAGCTACAACGA CTTAGGTA 


2554 


1014 


AGAUUUCU G UCUUGGGG 


227 


CCCCAAGA GGCTAGCTACAACGA AGAAATCT 


2555 


1022 


GUCUUGGG G UUUUUGGU 


228 


ACCAAAAA GGCTAGCTACAACGA CCCAAGAC 


2556 


1029 


GGUUUUUG G UGCAUGCA 


229 


TGCATGCA GGCTAGCTACAACGA CAAAAACC 


2557 


1031 


UUUUUGGU G CAUGCAGU 


230 


ACTGCATG GGCTAGCTACAACGA ACCAAAAA 


2558 


1033 


UUUGGUGC A UGCAGUUG 


231 


CAACTGCA GGCTAGCTACAACGA GCACCAAA 


2559 . 


1035 


UGGUGCAU G CAGUUGAU 


232 


ATCAACTG GGCTAGCTACAACGA ATGCACCA 


2560 


1038 


UGCAUGCA G UUGAUUAC 


233 


GTAATCAA GGCTAGCTACAACGA TGCATGCA 


2561 


1042 


UGCAGUUG A UUACUUCU 


234 


AGAAGTAA GGCTAGCTACAACGA CAACTGCA 


2562 


1045 


AGUUGAUU A CUUCUUAU 


235 


ATAAGAAG GGCTAGCTACAACGA AATCAACT 


2563 


1052 


UACUUCUU A UUUUUCUU 


236 


AAGAAAAA GGCTAGCTACAACGA AAGAAGTA 


2564 


1061 


UUUUUCUU A CCAAGUGU 


237 


ACACTTGG GGCTAGCTACAACGA AAGAAAAA 


2565 


1066 


CUUACCAA G UGUGAAUG 


238 


CATTCACA GGCTAGCTACAACGA TTGGTAAG 


2566 


1068 


UACCAAGU G UGAAUGUU 


239 


AACATTCA GGCTAGCTACAACGA ACTTGGTA 


2567 


1072 


AAGUGUGA A UGUUGGUG 


240 


CACCAACA GGCTAGCTACAACGA TCACACTT 


2568 


1074 


GUGUGAAU G UUGGUGUG 


241 


CACACCAA GGCTAGCTACAACGA ATTCACAC 


2569 


1078 


GAAUGUUG G UGUGAAAC 


242 


GTTTCACA GGCTAGCTACAACGA CAACATTC 


2570 


1080 


AUGUUGGU G UGAAACAA 


243 


TTGTTTCA GGCTAGCTACAACGA ACCAACAT 


2571 


1085 


GGUGUGAA A CAAAUUAA 


244 


TTAATTTG GGCTAGCTACAACGA TTCACACC 


2572 


1089 


UGAAACAA A UUAAUGAA 


245 ! 


TTCATTAA GGCTAGCTACAACGA TTGTTTCA 


2573 


1093 


ACAAAUUA A UGAAGCUU 


246 


AAGCTTCA GGCTAGCTACAACGA TAATTTGT 


2574 


1098 


UUAAUGAA G CUUUUGAA 


247 


TTCAAAAG GGCTAGCTACAACGA TTCATTAA ! 


2575 


1106 
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Av3ijijAJ.(aA GGCTAGCTACAACGA TCAAAAGC 


2576 


1109 


UUUGAAUC A UCCCUAUU 


249 


AATAGGGA GGCTAGCTACAACGA GATTCAAA 


2577 


1115 


UCAUCCCU A UUCUGUGU 


250 


ACACAGAA GGCTAGCTACAACGA AGGGATGA 


2578 


1120 


CCUAUUCU G UGUUUUAU 


251 


ATAAAACA GGCTAGCTACAACGA AGAATAGG 


2579 


1122 


UAUUCUGU G UUUUAUCU 


252 


AGATAAAA GGCTAGCTACAACGA ACAGAATA 


2580 


1127 


UGUGUUUU A UCUAGUCA 


253 


TGACTAGA GGCTAGCTACAACGA AAAACACA 


2581 


1132 


UUUAUCUA G UCACAUAA 


254 


TTATGTGA GGCTAGCTACAACGA TAGATAAA 


2582 


1135 


AUCUAGUC A CAUAAAUG 


255 


CATTTATG GGCTAGCTACAACGA GACTAGAT 


2583 


1137 


CUAGUCAC A UAAAUGGA 


256 


TCCATTTA GGCTAGCTACAACGA GTGACTAG 


2584 
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1141 


UCACAUAA A UGGAUUAA 


257 


TTAATCCA GGCTAGCTACAACGA TTATGTGA 




1145 


AUAAAUGG A UUAAUUAC 


258 


GTAATTAA GGCTAGCTACAACGA CCATT1A1 


9586 1 


1149 


AUGGAUUA A UUACUAAU 


259 


ATTAGTAA GGCTAGCTACAACGA TAATCCAT 
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1152 


GAUUAAUU A CUAAUUUC 


260 


GAAATTAG GGCTAGCTACAACGA AATTAAUL. 


2588 


1156 


AAUUACUA A UUUCAGUU 


261 


AACTGAAA GGCTAGCTACAACGA TAGlAAii 


2589 1 


1162 


UAAUUUCA G UUGAGACC 


262 
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GGTCTCAA GGCTAGCTACAACGA TGAAAliA 




1168 


CAGUUGAG A CCUUCUAA 


263 


TTAGAAGG GGCTAGCTACAACGA CTCAACTG 


2591 1 


1176 


ACCUUCUA A UUGGUUUU 


264 
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AAAACCAA GGCTAGCTACAACGA TAGAAGG1 


9592 1 


1180 


UCUAAUUG G UUUUUACU 


265 


AGTAAAAA GGCTAGCTACAACGA CAATTAGA 


2593 I 


1186 


UGGUUUUU A CUGAAACA 


266 


TGTTTCAG GGCTAGCTACAACGA AAAAACCA 


9 C.Q4 1 


1192 


UUACUGAA A CAUUGAGG 


267 


CCTCAATG GGCTAGCTACAACGA TTCAGTAA 


9595 1 


1194 


ACUGAAAC A UUGAGGGA 


268 


TCCCTCAA GGCTAGCTACAACGA GTTlLAbl 


9596 1 


1202 


AUUGAGGG A CACAAAUU 


269 
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AATTTGTG GGCTAGCTACAACGA CCCTCAAl 


2597 I 


1204 


UGAGGGAC A CAAAUUUA 


270 


TAAATTTG GGCTAGCTACAACGA GTC<-C1v_a 


2598 1 


1208 


GGACACAA A UUUAUGGG 


271 


CCCATAAA GGCTAGCTACAACGA TTGTCalUU j 


2599 1 


1212 


ACAAAUUU A UGGGCUUC 


272 


GAAGCCCA GGCTAGCTACAACGA AAATTTG1 


9^nn 1 


1216 


AUUUAUGG G CUUCCUGA 


273 


TCAGGAAG GGCTAGCTACAACGA CCATAAAT 


9£fl1 1 


1224 


GCUUCCUG A UGAUGAUU 


274 


AATCATCA GGCTAGCTACAACGA CAGGAAGC 


9^09 I 


1227 


UCCUGAUG A UGAUUCUU 


275 


AAGAATCA GGCTAGCTACAACGA CATCAGGA 


9^m. 1 

_2uUJ 1 


1230 


UGAUGAUG A UUCUUCUA 


276 


TAGAAGAA GGCTAGCTACAACGA CATCATCA 


9 £04. ^ 


1240 


UCUUCUAG G CAUCAUGU 


277 


ACATGATG GGCTAGCTACAACGA CTAGAAGA 


nrn^ 1 
<sOU3 j 


1242 


UUCUAGGC A UCAUGUCC 


278 


GGACATGA GGCTAGCTACAACGA GCCTAGAA 


9finfi 1 


1245 


UAGGCAUC A UGUCCUAU 


279 


ATAGGACA GGCTAGCTACAACGA GATGCCTA 


Z O \J I \ 


1247 


GGCAUCAU G UCCUAUAG 


280 


CTATAGGA GGCTAGCTACAACGA ATGATGCC 


o _; n ft i 


1252 


CAUGUCCU A UAGUUUGU 


281 


ACAAACTA GGCTAGCTACAACGA AGGACATG 


9fiftQ I 


1255 


GUCCUAUA G UUUGUCAU 


282 
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ATGACAAA GGCTAGCTACAACGA TATAbbAL 


9610 


1259 


UAUAGUUU G UCAUCCCU 


283 
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AGGGATGA GGCTAGCTACAACGA AAAL1A1A 


2611 I 


1262 


AGUUUGUC A UCCCUGAU 


284 


ATCAGGGA GGCTAGCTACAACGA GACAAACT 


9612 1 


1269 


CAUCCCUG A UGAAUGUA 


285 


TACATTCA GGCTAGCTACAACGA CAbbbAib 


2613 i 


1273 


CCUGAUGA A UGUAAAGU 


286 
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599 


TTCTGATG GGCTAGCTACAACGA GACTCAGT 


2927 


2697 


UGAGUCAC A UCAGAAAU 


600 


ATTTCTGA GGCTAGCTACAACGA GTGACTCA 


2928 


2704 


CAUCAGAA A UGCCCUAC 


601 


GTAGGGCA GGCTAGCTACAACGA TTCTGATG 


2929 


2706 


UCAGAAAU G CCCUACAU 


602 


ATGTAGGG GGCTAGCTACAACGA ATTTCTGA 


2930 


2711 


AAUGCCCU A CAUCUUAU 


603 


ATAAGATG GGCTAGCTACAACGA AGGGCATT 


2931 


2713 


UGCCCUAC A UCUUAUUU 


604 


AAATAAGA GGCTAGCTACAACGA GTAGGGCA 


2932 


2718 


UACAUCUU A UUUUCCUC 


605 


GAGGAAAA GGCTAGCTACAACGA AAGATGTA 


2933 


2730 


UCCUCAGG G CUCAAGAG 


606 


CTCTTGAG GGCTAGCTACAACGA CCTGAGGA 


2934 


2740 


UCAAGAGA A UCUGACAG 


607 


CTGTCAGA GGCTAGCTACAACGA TCTCTTGA 


2935 


2745 


AGAAUCUG A CAGAUACC 


608 


GGTATCTG GGCTAGCTACAACGA CAGATTCT 


2936 


2749 


UCUGACAG A UACCAUAA 


609 


| TTATGGTA GGCTAGCTACAACGA CTGTCAGA 


2937 


2751 


UGACAGAU A CCAUAAAG 


610 


| CTTTATGG GGCTAGCTACAACGA ATCTGTCA 


2938 


2754 


CAGAUACC A UAAAGGGA 


611 


TCCCTTTA GGCTAGCTACAACGA GGTATCTG 


2939 


2762 


AUAAAGGG A UUUGACCU 


612 


AGGTCAAA GGCTAGCTACAACGA LLLliiAi 




2767 


GGGAUUUG A CCUAAUCA 


613 


TGATTAGG GGCTAGCTACAACGA CAAATCCC 


2941 


2772 


UUGACCUA A UCACUAAU 


614 


ATTAGTGA GGCTAGCTACAACGA TAGGTCAA 


2942 


2775 


ACCUAAUC A CUAAUUUU 


615 


AAAATTAG GGCTAGCTACAACGA GATTAGGT 


2943 


2779 


AAUCACUA A UUUUCAGG 


616 


CCTGAAAA GGCTAGCTACAACGA TAGTGATT 


2944 


2787 


AUUUUCAG G UGGUGGCU 


617 


AGCCACCA GGCTAGCTACAACGA CTGAAAAT 


2945 


2790 


UUCAGGUG G UGGCUGAU 


618 


ATCAGCCA GGCTAGCTACAACGA CACCTGAA 


2946 


2793 


AGGUGGUG G CUGAUGCU 


619 


AGCATCAG GGCTAGCTACAACGA CACCACCT 


2947 


2797 


GGUGGCUG A UGCUUUGA 


620 


TCAAAGCA GGCTAGCTACAACGA CAGCCACC 


2948 
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2799 


UGGCUGAU 6 CUUUGAAC 


621 


GTTCAAAG GGCTAGCTACAACGA ATCAGCCA 


2949 


2806 


UGCUUUGA A CAUCUCUU 


622 


AAGAGATG GGCTAGCTACAACGA TCAAAGCA 


2950 


2808 


CUUUGAAC A UCUCUUUG 


623 


CAAAGAGA GGCTAGCTACAACGA GTTCAAAG 


2951 


2816 


AUCUCUUU G CUGCCCAA 


624 


TTGGGCAG GGCTAGCTACAACGA AAAGAGAT 


2952 


, 2819 


UCUUUGCU G CCCAAUCC 


625 


GGATTGGG GGCTAGCTACAACGA AGCAAAGA 


2953 


2824 


GCUGCCCA A UCCAUUAG 


626 


CTAATGGA GGCTAGCTACAACGA TGGGCAGC 


2954 


2828 


CCCAAUCC A UUAGCGAC 


627 


GTCGCTAA GGCTAGCTACAACGA GGATTGGG 


2955 


., 2832 


AUCCAUUA G CGACAGUA 


628 


TACTGTCG GGCTAGCTACAACGA TAATGGAT 


2956 


2835 


CAUUAGCG A CAGUAGGA 


629 


TCCTACTG GGCTAGCTACAACGA CGCTAATG 


2957 


2938 


UAGCGACA G UAGGAUUU 


630 


AAATCCTA GGCTAGCTACAACGA TGTCGCTA 


2958 


2843 


ACAGUAGG A UUUUUCAA 


631 


TTGAAAAA GGCTAGCTACAACGA CCTACTGT 


2959 


. 2R51 


AUUUUUCA A CCCUGGUA 


632 


TACCAGGG GGCTAGCTACAACGA TGAAAAAT 


2960 


..,2857 


CAACCCUG G UAUGAAUA 


633 


TATTCATA GGCTAGCTACAACGA CAGGGTTG 


2961 


2859 


ACCCUGGU A UGAAUAGA 


634 


TCTATTCA GGCTAGCTACAACGA ACCAGGGT 


2962 


. 2803 


UGGUAUGA A UAGACAGA 


635 


TCTGTCTA GGCTAGCTACAACGA TCATACCA 


2963 


;. 2867 


AUGAAUAG A CAGAACCC 


636 


GGGTTCTG GGCTAGCTACAACGA CTATTCAT 


2964 


2872 


UAGACAGA A CCCUAUCC 


637 


GGATAGGG GGCTAGCTACAACGA TCTGTCTA 


2965 


2877 


AGAACCCU A UCCAGUGG 


638 


CCACTGGA GGCTAGCTACAACGA AGGGTTCT 


2966 


2882 


CCUAUCCA G UGGAAGGA 


639 


TCCTTCCA GGCTAGCTACAACGA TGGATAGG 


2967 


2893 


GAAGGAGA A UUUAAUAA 


640 


TTATTAAA GGCTAGCTACAACGA TCTCCTTC 


2968 


2898 


AGAAUUUA A UAAAGAUA 


641 


TATCTTTA GGCTAGCTACAACGA TAAATTCT 


2969 


2904 


UAAUAAAG A UAGUGCAG 


642 


CTGCACTA GGCTAGCTACAACGA CTTTATTA 


2970 


2907 


UAAAGAUA G UGCAGAAA 


643 


TTTCTGCA GGCTAGCTACAACGA TATCTTTA 


2971 


2909 


AAGAUAGU G CAGAAAGA 


644 


TCTTTCTG GGCTAGCTACAACGA ACTATCTT 


2972 


2918 


CAGAAAGA A UUCCUUAG 


645 


CTAAGGAA GGCTAGCTACAACGA TCTTTCTG 


2973 


2927 


UUCCUUAG G UAAUCUAU 


646 


ATAGATTA GGCTAGCTACAACGA CTAAGGAA 


2974 


2930 


CUUAGGUA A UCUAUAAC 


647 


GTTATAGA GGCTAGCTACAACGA TACCTAAG 


2975 


2934 


GGUAAUCU A UAACUAGG 


648 


CCTAGTTA GGCTAGCTACAACGA AGATTACC 


2976 


2937 


AAUCUAUA A CUAGGACU 


649 


AGTCCTAG GGCTAGCTACAACGA TATAGATT 


2977 


2943 


UAACUAGG A CUACUCCU 


650 


AGGAGTAG GGCTAGCTACAACGA CCTAGTTA 


2978 


2946 


CUAGGACU A CUCCUGGU 


651 


ACCAGGAG GGCTAGCTACAACGA AGTCCTAG 


2979 


2953 


UACUCCUG G UAACAGUA 


652 


TACTGTTA GGCTAGCTACAACGA CAGGAGTA 


2980 


2956 


UCCUGGUA A CAGUAAUA 


653 


TATTACTG GGCTAGCTACAACGA TACCAGGA 


2981 


. 2959 


UGGUAACA G UAAUACAU 


654 


ATGTATTA GGCTAGCTACAACGA TGTTACCA 


2982 


2962 


UAACAGUA A UACAUUCC 


655 


GGAATGTA GGCTAGCTACAACGA TACTGTTA 


2983 


2964 


ACAGUAAU A CAUUCCAU 


656 


ATGGAATG GGCTAGCTACAACGA ATTACTGT 


2984 


2966 


AGUAAUAC A UUCCAUUG 


657 


CAATGGAA GGCTAGCTACAACGA GTATTACT 


2985 


2971 


UACAUUCC A UUGUUUUA 


658 


TAAAACAA GGCTAGCTACAACGA GGAATGTA 


2986 


2974 


AUUCCAUU G UUUUAGUA 


659 


TACTAAAA GGCTAGCTACAACGA AATGGAAT 


2987 


2980 


UUGUUUUA G UAACCAGA 


660 


TCTGGTTA GGCTAGCTACAACGA TAAAACAA 


2988 


2983 


UUUUAGUA A CCAGAAAU 


661 


ATTTCTGG GGCTAGCTACAACGA TACTAAAA 


2989 


2990 


AACCAGAA A UCUUCAUG 


662 


CATGAAGA GGCTAGCTACAACGA TTCTGGTT 


2990 


2996 


AAAUCUUC A UGCAAUGA 


663 


TCATTGCA GGCTAGCTACAACGA GAAGATTT 


2991 


2998 


AUCUUCAU G CAAUGAAA 


664 


TTTCATTG GGCTAGCTACAACGA ATGAAGAT 


2992 


3001 


UUCAUGCA A UGAAAAAU 


665 


ATTTTTCA GGCTAGCTACAACGA TGCATGAA 


2993 


3008 


AAUGAAAA A UACUUUAA 


666 


TTAAAGTA GGCTAGCTACAACGA TTTTCATT 


2994 


3010 


UGAAAAAU A CUUUAAUU 


667 


AATTAAAG GGCTAGCTACAACGA ATTTTTCA 


2995 


3016 


AUACUUUA A UUCAUGAA 


668 


TTCATGAA GGCTAGCTACAACGA TAAAGTAT 


2996 


3020 


UUUAAUUC A UGAAGCUU 


669 


AAGCTTCA GGCTAGCTACAACGA GAATTAAA 1 


2997 


3025 


UUCAUGAA G CUUACUUU 


670 


AAAGTAAG GGCTAGCTACAACGA TTCATGAA 


2998 


3029 


UGAAGCUU A CUUUUUUU 


671 


AAAAAAAG GGCTAGCTACAACGA AAGCTTCA 


2999 


3044 


UUUUUUUG G UGUCAGAG 


672 


CTCTGACA GGCTAGCTACAACGA CAAAAAAA 


3000 
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3046 


UUUUUGGU G UCAGAGUC 


673 


GACTCTGA GGCTAGCTACAACGA ACCAAAAA 


3001 


3052 


GUGUCAGA G UCUCGCUC 


674 


GAGCGAGA GGCTAGCTACAACGA TCTGACAC 


3002 


3057 


AGAGUCUC G CUCUUGUC 


675 


GACAAGAG GGCTAGCTACAACGA GAGACTCT 


3003 


3063 


UCGCUCUU G UCACCCAG 


676 


CTGGGTGA GGCTAGCTACAACGA AAGAGCGA 


3004 


3066 


CUCUUGUC A CCCAGGCU 


677 


AGCCTGGG GGCTAGCTACAACGA GACAAGAG 


3005 


3072 


UCACCCAG G CUGGAAUG 


678 


CATTCCAG GGCTAGCTACAACGA CTGGGTGA 


3006 


3078 


AGGCUGGA A UGCAGUGG 


679 


CCACTGCA GGCTAGCTACAACGA TCCAGCCT 


3007 


3080 


GCUGGAAU G CAGUGGCG 


680 


CGCCACTG GGCTAGCTACAACGA ATTCCAGC 


3008 


3083 


GGAAUGCA G UGGCGCCA 


681 


TGGCGCCA GGCTAGCTACAACGA TGCATTCC 


3009 


3086 


AUGCAGUG G CGCCAUCU 


682 


AGATGGCG GGCTAGCTACAACGA CACTGCAT 


3010 


3088 


GCAGUGGC G CCAUCUCA 


683 


TGAGATGG GGCTAGCTACAACGA GCCACTGC 


3011 


3091 


GUGGCGCC A UCUCAGCU 


684 


AGCTGAGA GGCTAGCTACAACGA GGCGCCAC 


3012 


3097 


CCAUCUCA G CUCACUGC 


685 


GCAGTGAG GGCTAGCTACAACGA TGAGATGG 


3013 


3101 


CUCAGCUC A CUGCAACC 


686 


GGTTGCAG GGCTAGCTACAACGA GAGCTGAG 


3014 


3104 


AGCUCACU G CAACCUUC 


687 


GAAGGTTG GGCTAGCTACAACGA AGTGAGCT 


3015 


3107 


UCACUGCA A CCUUCCAU 


688 


ATGGAAGG GGCTAGCTACAACGA TGCAGTGA 


3016 


3114 


AACCUUCC A UCUUCCCA 


689 


TGGGAAGA GGCTAGCTACAACGA GGAAGGTT 


3017 


3124 


CUUCCCAG G UUCAAGCG 


690 


CGCTTGAA GGCTAGCTACAACGA CTGGGAAG 


3018 


3130 


AGGUUCAA G CGAUUCUC 


691 


GAGAATCG GGCTAGCTACAACGA TTGAACCT 


3019 


3133 


UUCAAGCG A UUCUCGUG 


692 


CACGAGAA GGCTAGCTACAACGA CGCTTGAA 


3020 


3139 


CGAUUCUC G UGCCUCGG 


693 


CCGAGGCA GGCTAGCTACAACGA GAGAATCG 


3021 


3141 


AUUCUCGU G CCUCGGCC 


694 


GGCCGAGG GGCTAGCTACAACGA ACGAGAAT 


3022 


3147 


GUGCCUCG G CCUCCUGA 


695 


TCAGGAGG GGCTAGCTACAACGA CGAGGCAC 


3023 . 


3156 


CCUCCUGA G UAGCUGGG 


696 


CCCAGCTA GGCTAGCTACAACGA TCAGGAGG 


3024 


3159 


CCUGAGUA G CUGGGAUU 


697 


AATCCCAG GGCTAGCTACAACGA TACTCAGG 


3025 


3165 


UAGCUGGG A UUACAGGC 


698 


GCCTGTAA GGCTAGCTACAACGA CCCAGCTA 


3026 


3168 


CUGGGAUU A CAGGCGUG 


699 


CACGCCTG GGCTAGCTACAACGA AATCCCAG 


3027 


3172 


GAUUACAG G CGUGUGCA 


700 


TGCACACG GGCTAGCTACAACGA CTGTAATC 


3028 


3174 


UUACAGGC G UGUGCACU 


701 


AGTGCACA GGCTAGCTACAACGA GCCTGTAA 


3029 


3176 


ACAGGCGU G UGCACUAC 


702 


GTAGTGCA GGCTAGCTACAACGA ACGCCTGT 


3030 


3178 


AGGCGUGU G CACUACAC 


703 


GTGTAGTG GGCTAGCTACAACGA ACACGCCT 


3031 


3180 


GCGUGUGC A CUACACUC 


704 


GAGTGTAG GGCTAGCTACAACGA GCACACGC 


3032 


3183 


UGUGCACU A CACUCAAC 


705 


GTTGAGTG GGCTAGCTACAACGA AGTGCACA 


3033 


3185 


UGCACUAC A CUCAACUA 


706 


TAGTTGAG GGCTAGCTACAACGA GTAGTGCA 


3034 


3190 


UACACUCA A CUAAUUUU 


707 


AAAATTAG GGCTAGCTACAACGA TGAGTGTA 


3035 


3194 


CUCAACUA A UUUUUGUA 


708 


TACAAAAA GGCTAGCTACAACGA TAGTTGAG 


3036 


3200 


UAAUUUUU G UAUUUUUA 


709 


TAAAAATA GGCTAGCTACAACGA AAAAATTA 


3037 


3202 


AUUUUUGU A UUUUUAGG 


710 


CCTAAAAA GGCTAGCTACAACGA ACAAAAAT 


3038 


3215 


UAGGAGAG A CGGGGUUU 


711 


AAACCCCG GGCTAGCTACAACGA CTCTCCTA 


3039 


3220 


GAGACGGG G UUUCACCU 


712 


AGGTGAAA GGCTAGCTACAACGA CCCGTCTC 


3040 


3225 


GGGGUUUC A CCUGUUGG 


713 


CCAACAGG GGCTAGCTACAACGA GAAACCCC 


3041 


3229 


UUUCACCU G UUGGCCAG 


714 


CTGGCCAA GGCTAGCTACAACGA AGGTGAAA 


3042 


3233 


ACCUGUUG G CCAGGCUG 


715 


CAGCCTGG GGCTAGCTACAACGA CAACAGGT 


3043 


3238 


UUGGCCAG G CUGGUCUC 


716 


GAGACCAG GGCTAGCTACAACGA CTGGCCAA 




3242 


CCAGGCUG G UCUCGAAC 


717 


GTTCGAGA GGCTAGCTACAACGA CAGCCTGG 


3045 


3249 


GGUCUCGA A CUCCUGAC 


718, 


GTCAGGAG GGCTAGCTACAACGA TCGAGACC 


3046 


3256 


AACUCCUG A CCUCAAGU 


719 


ACTTGAGG GGCTAGCTACAACGA CAGGAGTT 


3047 


3263 


GACCUCAA G UGAUUCAC 


720 


GTGAATCA GGCTAGCTACAACGA TTGAGGTC 


3048 


3266 


CUCAAGUG A UUCACCCA 


721 


TGGGTGAA GGCTAGCTACAACGA CACTTGAG 


3049 


3270 


AGUGAUUC A CCCACCUU 


722 


AAGGTGGG GGCTAGCTACAACGA GAATCACT 


3050 


3274 


AUUCACCC A CCUUGGCC 


723 


GGCCAAGG GGCTAGCTACAACGA GGGTGAAT 


3051 


3280 


CCACCUUG G CCUCAUAA 


724 


TTATGAGG GGCTAGCTACAACGA CAAGGTGG 


3052 
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3285 


UUGGCCUP A ITAAAPPTTT! 


77 *^ 
/ £. 3 


CAGGTTTA GGCTAG CTACAACG A GAGGCCAA 


3053 


3289 


CCUCAUAA A CCUGUUUU 




AAAACAGG GG CT AG CTACAACG A TTATGAGG 


3054 


3293 


AUAAACCU G UUUUGPAR 


777 


\- IXjGAAAA GGLTx AG CTACAACGA AGGTTTAT 


3055 


3298 


CCUGUUUU G CAGAAPTTP 


77 R 


vjA^j 1 i L. 1 U vj<jCTAGCJrACAACGA AAAACAGG 


3056 


3303 


UUUGCAGA A PTTPATTT1TTA 


/_i y 


TAAATGAG GG CTAG CTACAACGA TCTGCAAA 


3057 


3307 


CAGAACUC A UUUAUUCA 




liaAAliiAA vjijv- 1 Aibrv_lAv_AACGA GAGTTCTG 


3058 


3311 


ACUCAUUU A UUCAGCAA 


/ox 


1 iijUXAjtAA Livjl_ 1 AGCl ACAACGA AAATGAGT 


3059 


3316 


UUUAUUCA G CAAATTATTTT 




AAi Al 1 lAj Gvj LTI AG CTACAACGA TGAATAAA 


3060 


3320 


UUCAGCAA A UAUUUAUU 


7^1 ^ 
zoo 


AAXAAA1A vjIj u 1 AGCTACAACGA TTGCTGAA 


3061 


3322 


-CAGCAAAU A UTJUAT7TIGA 


/Oft 


i LAAIAAA CjvjC 1 AGCTACAACGA ATTTGCTG 


3062 


3326 


AAAUAUUU A IITTGAGTTGP 


7** 
/JO 


i GLAL.1LAA GGCTAG CTACAACGA AAATATTT 


3063 


3331 


UUUAUUGA G tJGPPTTAPr 1 


7** £ 


vjGIAvjGUA GGCTAG C i ACAACG A TCAATAAA 


3064 


3333 


UAUUGAGU G PPTJAPPAP. 


7^7 
/ 3 / 


CI\jGTAGG GG CT AG CTACAACGA ACTCAATA 


3065 


3337 


GAGUGCCU A PP AG AT TGP 

unuvvj\.v.u r\ v_v__ - taj.£_»jvjI». 


7 ft 
/JO 


GCATCTGG GGCTAGCTACAACGA AGGCACTC 


3066 


3342 


CCUACCAG A UGCPAGTIP 


*7 0 Q 
/J? 


GAL. 1 vjGGA GGCTAGCTACAACGA CTGGTAGG 


3067 


3344 


UACCAGAU G CPAGUPAP 


/ fx U 


vj1\jAL1vjvj GGv^AGCTACAACGA ATCTCrGTA 


3068 


3348 


AGAUGCCA G ITPAPPGPA 


7_L T 
/ffcX 


TGCGGTGA GGCTAGCTACAACGA TGGCATCT 


3069 


3351 


UGCCAGITP A PPGPAP7AB. 


7 A 7 
/fl_i 


TTGTGCGG GGCTAGCTACAACGA GACTGGCA 


3070 


3354 


CAGUCAPP G PAPA&GPP 

V^*~W_I WwTVV^ VJ V»J^V_X_f^VJ\JV_, 


7 A "5 


GCCTTGTG GGCTAGCTACAACGA GGTGACTG 


3071 


3356 


GUCAPPGP A PAAGGPAP 


/ ft ft 


GTGCCTTG GGCTAGCTACAACGA GCGGTGAC 


3072 


3361 


PGPAPA An n PAPTTPPTITT 


/ft_> 


ACCCAGTG GGCTAGCTACAACGA CTTGTGCG 


3073 


3363 


CAPAAGGP A PTTGGGTTATT 


/ft b 


ATACCCAG GGCTAGCTACAACGA GCCTTGTG 


3074 


3368 


GGCAPUGG P. TTATTATTPflTT 


TAT 
/ft / 


ACCATATA GGCTAGCTACAACGA CCAGTGCC 


3075 


3370 


CACUGGGU A TTATTGGTTATT 
wtv^wwvjwu jn> vj./A.iJvj\jvj.r__i 


/ft O 


ATACCATA GGCTAGCTACAACGA ACCCAGTG 


3076 


3372 


CUGGGUAIT A TTGGTTATTPP 


/ft y 


GGATACCA GGCTAGCTACAACGA ATACCCAG 


3077 


3375 


GGUAUAUG G UATTPPPPA 

wwvnunuvj vj unu^v>v>Un 


7cn 


TGGGGATA GGCTAGCTACAACGA CATATACC 


3078 


3377 


UAUAUGGU A UPCPPAAA 


7ci 
/ 3 X 


TTTGGGGA GGCTAGCTACAACGA ACCATATA 


3079 


3385 


AUCCCCAA A CAAGAGAP 


7^7 
/ 3 _» 


blLlLl xG GGC x AG CTACAACGA TTGGGGAT 


3080 


3392 


AACAAGAG A CAUAATIPP 


/ 3 J 


GGAx 1 AlXj GGC^i AGCTACAACGA CTCTTGTT 


3081 


3394 


CAAGAGAC A UAAUCCCG 


/ 3 ft 


CGGGATTA GGCTAGCTACAACGA GTCTCTTG 


3082 


3397 


GAGACAUA A UCCCGGUC 


7^R 

/ 3 3 


gal. L.GGGA GGG1 AGC xACAACGA TATGTCTC 


3083 


3403 


UAAUCCCG G UCCUUAGG 


7cc 

/ 3 O 


PPTM^nS /^O A /""i /""WIT tv 7v tv tv nn^nit mm « 

v_t_ 1 aagga bbL 1 AGCTACAACGA CGGGATTA 


3084 


3411 


GUCCUUAG G UACUGCUA 


7^7 

/ 3 / 


1 AGL-AG 1 A GGL. 1 AGCTACAACGA CTAAGGAC 


3085 


3413 


CCUUAGGU A CUGCUAGU 


/ 3 O 


AG1AGUAG GGCTAGCTACAACGA ACCTAAGG 


3086 


* 3416 


UAGGUACU G CUAGUGUG 


7CQ 
/ 3 _7 


PRP3\PT7l(*' O /"TT A /T /~"T» TV TV TV /"VI TV t> ^irrtTV f%T^ 

GAGAGJ.AG GGG i AGCTIACAACGA AGTACCTA 


3087 


3420 


UACUGCUA G UGUGGUCU 


760 


7A^_7A ^T^A /"'A Z^/^/ - T*A /~*/^«t<tv /~ iTV tv /~i~ itv m« /l/n* /*<mii 

AvjAGGAGA GGL. X AGCTACAACGA TAGCAGTA 


3088 


3422 


CUGCUAGU G UGGUCUGU 


/ox 


3i A^!!A A /^/^•^*TTA^ , '/^"TT7V tv tv tv tv ititv />«^i* 

ALAbALLA GGCx AGCTACAACGA ACTAGCAG 


3089 


^3425 


CUAGUGUG G UCUGUAAU 


7CO 


All AL-AG A GG G TAG CTACAACGA CACACTAG 


3090 


3429 


UGUGGUCU G UAAUAUCU 


/DO 


AGAiAilA GGL. I AGLTIACAACGA AGACCACA 


3091 


3432 


GGUCUGUA A UAUCUUAP 


/ O ft 


- /"i* 1 ' A A OAT 1 A f~*f^ /^T*TV ^/~mt*tv rTTV tv /^/^ti m-n >— i — . — 

GlAAGAlA G G CTAG CTACAACGA TACAGACC 


3092 


3434 


UCUGUAAU A UPUTTAPTTA 


/ O 3 


TAGTAAGA GGCTAGCTACAACGA ATTACAGA 


3093 


3439 


AAUAUCUU A CUAAGGPP 


7£^ 
/DO 


GGCCTTAG GGCTAGCTACAACGA AAGATATT 


3094 


3445 


UUACUAAG G CCUUUGGU 


7fi7 

to/ 


AL.LAAAGG GGCTAGCTACAACGA CTTAGTAA 


3095 


3452 


GGCCUUUG G UAUACGAC 


768 


GTCGTATA GGCTAG CTACAACGA CAAAGGCC 


3096 


3454 


CCUUUGGU A UACGACCC 


769 


GGGTCGTA GGCTAGCTACAACGA ACCAAAGG 


3097 


3456 


UUUGGUAU A CGACCCAG 


770 


CTGGGTCG GGCTAGCTACAACGA ATACCAAA 


3098 


3459 


GGUAUACG A CCCAGAGA 


771 


TCTCTGGG GGCTAGCTACAACGA CGTATACC 


3099 


3467 


ACCCAGAG A UAACACGA 


772 


TCGTGTTA GGCTAGCTACAACGA CTCTGGGT 


3100 


3470 


CAGAGAUA A CACGAUGC 


773 


GCATCGTG GGCTAGCTACAACGA TATCTCTG 


3101 


3472 


GAGAUAAC A CGAUGCGU 


774 


ACGCATCG GGCTAGCTACAACGA GTTATCTC 


3102 


3475 


AUAACACG A UGCGUAUU 


775 


AATACGCA GGCTAGCTACAACGA CGTGTTAT 


3103 


3477 


AACACGAU G CGUAUUUU 


776 


AAAATACG GGCTAGCTACAACGA ATCGTGTT 


3104 
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I 3479 


CACGAUGC G UAUUUUAG 


777 


CTAAAATA GGCTAGCTACAACGA GCATCGTG 


3105 i 


| 3481 


CGAUGCGU A UUUUAGUU 


778 


AACTAAAA GGCTAGCTACAACGA ACGCATCG 


3106 I 


3487 


GUAUUUUA G UUUUGCAA 


779 


TTGCAAAA GGCTAGCTACAACGA TAAAATAC 


3107 1 


3492 


UUAGUUUU G CAAAGAAG 
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3548 


5376 


AUAUGGAU A CUUUAUAA 


1221 


TTATAAAG GGCTAGCTACAACGA ATCCATAT 


3549 


5381 


GAUACUUU A UAAGCCAU 


1222 


ATGGCTTA GGCTAGCTACAACGA AAAGTATC 


3550 


5385 


CUUUAUAA G CCAUAGAC 


1223 


GTCTATGG GGCTAGCTACAACGA TTATAAAG 


3551 


5388 


UAUAAGCC A UAGACACU 


1224 


AGTGTCTA GGCTAGCTACAACGA GGCTTATA 


3552 


5392 


AGCCAUAG A CACUAUAG 


1225 


CTATAGTG GGCTAGCTACAACGA CTATGGCT 


3553 


5394 


CCAUAGAC A CUAUAGUA 


1226 


TACTATAG GGCTAGCTACAACGA GTCTATGG 


3554 


5397 


UAGACACU A UAGUAUAC 


1227 


GTATACTA GGCTAGCTACAACGA AGTGTCTA 


3555 


5400 


ACACUAUA G UAUACCAG 


1228 


CTGGTATA GGCTAGCTACAACGA TATAGTGT 


3556 


5402 


ACUAUAGU A UACCAGUG 


1229 


CACTGGTA GGCTAGCTACAACGA ACTATAGT 


3557 


5404 


UAUAGUAU A CCAGUGAA 


1230 


TTCACTGG GGCTAGCTACAACGA ATACTATA 


3558 


5408 


GUAUACCA G UGAAUCUU 


1231 


AAGATTCA GGCTAGCTACAACGA TGGTATAC 


3559 


5412 


I ACCAGUGA A UCUUUUAU 


1232 


ATAAAAGA GGCTAGCTACAACGA TCACTGGT 


3560 


5419 


| AAUCUUUU A UGCAGCUU 


1233 


AAGCTGCA GGCTAGCTACAACGA AAAAGATT 


3561 


5421 


UCUUUUAU G CAGCUUGU 


1234 


ACAAGCTG GGCTAGCTACAACGA ATAAAAGA 


3562 


5424 


UUUAUGCA G CUUGUUAG 


1235 


CTAACAAG GGCTAGCTACAACGA TGCATAAA 


3563 


5428 


I UGCAGCUU G UUAGAAGU 


1236 


« /nrrrn/imTL « /-»/TWT»7V*TVTA fftft CfZ'h. ZV Z^f^T'T'flPA 

ACTJL 1 vJT AA 1 Avj\_ X r\\~JM\\~\*r\ t\*v<y\* x 


3564 


5435 


UGUUAGAA G UAUCCUUU 


1237 


AAAGGATA GGCTAGCTACAACGA TTCTAACA 


3565 


5437 


UUAGAAGU A UCCUUUUA 


1238 


TAAAAGGA GGCTAGCTACAACGA ACTTCTAA 


3566 


5445 


AUCCUUUU A UUUUCUAA 


1239 


TTAGAAAA GGCTAGCTACAACGA AAAAGGAT 


3567 


5457 


UCUAAAAG G UGCUGUGG 


1240 


CCACAGCA GGCTAGCTACAACGA CTTTTAGA 


3568 


5459 


UAAAAGGU G CUGUGGAU 


1241 


ATCCACAG GGCTAGCTACAACGA ACCTTTTA 


3569 


5462 


AAGGUGCU G UGGAUAUU 


1242 


AATATCCA GGCTAGCTACAACGA AGCACCTT 


3570 


5466 


UGCUGUGG A UAUUAUGU 


1243 


ACATAATA GGCTAGCTACAACGA CCACAGCA 


3571 


1 5468 


j CUGUGGAU A UUAUGUAA 


1244 


TTACATAA GGCTAGCTACAACGA ATCCACAG 


3572 
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5471 


TTPPAITATTTT A TTPTTA A An/" -1 


1 1A C 


CCTTTACA GGCTAGCTACAACGA AATATCCA 


3573 


5473 


P ATTAT1TTATT p ITaafif^PrT 




CGCCTTTA G GCTAG CTACAACG A ATAATATC 


3574 


5479 


ATTPTTA A AP fl PPT TCI TT TT TP 
AUVjUAAAVj V3 CoUvjUUUo 


T *5 A *7 
J.24 / 


CAAACACG GGCTAGCTACAACGA CTTTACAT 


3575 


5481 


nil A a appp p t tpt tt tt jc rn t 

vjUAAAvjvjL. \3 UvjUU UuLU 




AGCAAACA GGCTAGCTACAACGA GCCTTTAC 


3576 


5483 


AAAPPPPTT P TTTTTTrTTTTrR 




TAAGCAAA GGCTAGCTACAACGA ACGCCTTT 


3577 


5487 


PPPfTPTTTTTT P PTTTTA aan 


"1 O E A 


TGTTTAAG GGCTAGCTACAACGA AAACACGC 


3578 


54 93 


TTT TP PTTTTA A A PA aTTTTTTTTr 1 




GAAAATTG GGCTAGCTACAACGA TTAAGCAA 


3579 


3 *t -7 D 


PTTTTA A A PR A TTTTTTTTP/"*ATT 


1252 


ATGGAAAA GGCTAGCTACAACGA TGTTTAAG 


3580 




A ATTTTTTTT/T* A T TA TTT TTT7\ P 1\ 
rtrtUUUULL A UAUUUAGA 


1253 


TCTAAATA GGCTAGCTACAACGA GGAAAATT 


3581 


3 3 u 3 


TTTTTTTTPPATT A TTT TTT A PA AO 


1254 


CTTCTAAA GGCTAGCTACAACGA ATGGAAAA 


3582 


j -j — j 


ATTTTITAPAA P TTHPaTTPPB 
AU U U A\_iAA Ij UAvaAUL»(JA 


1255 


TGCATCTA GGCTAGCTACAACGA TTCTAAAT 


3583 


5 517 


APAAPTTAP A T TP PA7\A7\P 


12b6 


GTTTTGCA GGCTAGCTACAACGA CTACTTCT 


3584 


5 519 


A APT TAP ATT P PAAAAOAA 


T O C T 

1257 


TTGTTTTG GGCTAGCTACAACGA ATCTACTT 


3585 


5 524 


HMirri t a a psb attpttp 

UnUoLrtrtrt A v«AAAUv».Uv\3 


1258 


CAGATTTG GGCTAGCTACAACGA TTTGCATC 


3586 


5 528 


PA A A Af* A A A T TfTTPPPTTTT 


TOCO 

12b9 


AAGGCAGA GGCTAGCTACAACGA TTGTTTTG 


3587 


.5532 


APA A AI IPTT P PPT TT TT TA T TP 


J.2b U 


CATAAAGG GGCTAGCTACAACGA AGATTTGT 


3588 


5538 


PTTPPPTTTITI A TTPAPAAAA 
LUuLLUUU A UVjA.v_~AAAA 


J.2bx 


TTTTGTCA GGCTAGCTACAACGA AAAGGCAG 


3589 


5541 


PPT TT FT T A T 1C* A PAAAAAAA 


1262 


TTTTTTTG GGCTAGCTACAACGA CATAAAGG 


3590 




AOAAAAAA A TT7\rr«'MT'n 7\ 

Al»AAAAAA A UAvaGAUAA 


1263 


TTATCCTA GGCTAGCTACAACGA TTTTTTGT 


3591 


3 3 3*i 


AAA hTIfi^r 1 A TTA A PA TTTT7\ 

iin/irtUAUu A UAAvJAUUA 


1264 


! TAATGTTA GGCTAGCTACAACGA CCTATTTT 


3592 


3 3 3/ 


RUaPPRTTR A P ATTTT7\ T TT TTT 

rtUrtbbAUA A UAU UAU UU 


1265 


! AAATAATG GGCTAGCTACAACGA TATCCTAT 


3593 


5559 


APPA1TAAP A TTTTATTTTTT7\TT 
MuurtUArtV, A U UrtU U UAU 


1266 


ATAAATAA GGCTAGCTACAACGA GTTATCCT 


3594 


5562 


ATTAAPATTTT A TTTTTTA TTT TTT A 
hUrUiUiUU A UUUAUUUA 


1257 


TAAATAAA GGCTAGCTACAACGA AATGTTAT 


3595 


5566 


PATTTTATTTTTT A TTTTTTA TTT TT TP 
LHU UAU U U A UU UAU UUL 


1268 


GAAATAAA GGCTAGCTACAACGA AAATAATG 


3596 


5570 


ATTTTTTATTTTTT A TTT TTT/"*' /TTT TTT 
AUUUAUUU A UUULLUUU 


1269 


AAAGGAAA GGCTAGCTACAACGA AAATAAAT 


3597 


5580 


TTTTPPTTTTTTTT A TTPA A TTA TiP 
UULLUUUU A ULAriUAAlj 


1270 


CTTATTGA GGCTAGCTACAACGA AAAAGGAA 


3598 


5584 


rTTTTTTTATTPA A TTAAPOTTAA 
UUUUnUun A Urirtvjourtrt 


1271 


TT AC (JUT A GGCTAGCTACAACGA TGATAAAA 


3599 


5589 


TTPAATTAAP P TTA ATTTTfATT 
UUnnUnnu \J UAAUUuAU 


J.2 /2 


ATCAATTA GGCTAGCTACAACGA CTTATTGA 


3600 


5592 


AITAAPPTTA A TTTTPATTAOA 


T O T "5 


TGTATCAA GGCTAGCTACAACGA TACCTTAT 


3601 


5596 


PPTTAATTTTP A TTAPAr'AAn 


±2 /4 


GTTGTGTA GGCTAGCTACAACGA CAATTACC 


3602 


5598 


TTAATTTTPATT A PAPA APAP 

UnrtUUu/iU A t A-VV Mr\'_ rVjr 


T O H C 

JL2 /3 


CTGTTGTG GGCTAGCTACAACGA ATCAATTA 


3603 


5600 


AT7TTPATTAP A PAAPAPPTT 


T O *7 
JL2 / D 


ACCTGTTG GGCTAGCTACAACGA GTATCAAT 


3604 


5603 


PATTAPAPA A PAPPTTPAP 


xZ 1 1 


GTCACCTG GGCTAGCTACAACGA TGTGTATC 


3605 


5607 


PAPAAPAP P TTPAPTTTTP-P 


1 O *7 Q 

±2 /o 


CCAAGTCA GGCTAGCTACAACGA CTGTTGTG 


3606 


5610 


AAPAPPTJP A PTTTTPPTTTTTT 


X2 / y 


AAACCAAG GGCTAGCTACAACGA CACCTGTT 


3607 


5615 


GUGArTTTTG P. IJTTTTTTAPPP 


t 0 q a 


/*T /T/'-WYi TV TA TA TV /^T /*T /"u 1 1 tv /^tj il tv r^TV TV /"i/ttl ^**tv tv ^trrr^iTv 

bLdAAM GGCTAGCTACAACGA CAAGTCAC 


3608 


5622 


GGI7TITITTAP P PPPAAAPP 




CC I iTGGG GGCTAGCTACAACGA CTAAAACC 


3609 


5630 


GPPPAAAP P ITAPPAPPA 
vjv-\».v*AAAVj vj U Ava Lrtu v_A 


T O Q O 


TGCTGCTA GGCTAGCTACAACGA CTTTGGGC 


3610 


5633 


PAAAPPTTA p raoranna 

v — nrtnouUn v— AvjL^AvjL-A 


12o J 


TGCTGCTG GGCTAGCTACAACGA TACCTTTG 


3611 


5636 


APPT7APPA P PAPPAAPA 
rtVJOUnuv^i v3 v-~Avj\_AAv—A 


1 004 
J.2U4 


TGTTGCTG GGCTAGCTACAACGA TGCTACCT 


3612 


5639 


TTAPPAPPA P PAAPATTTTA 
UAVjv-AVjV^A 0 UAnUAU UA 


1285 


TAATGTTG GGCTAGCTACAACGA TGCTGCTA 


3613 


5642 


PAPPAPPA A PATTTTAATTA 
UAVjv^nVjV^rV A V^AUUAAUA 


1 O D C 

J.2 00 


TATTAATG GGCTAGCTACAACGA TGCTGCTG 


3614 


5644 


GPAGPAAP A TTTTAATTAATT 


-L^o / 


ATTATTAA GGCTAGCTACAACGA GTTGCTGC 


3615 


5648 


CAACA17UA A UAAUGGAA 


1288 


TTCCATTA GGCTAGCTACAACGA TAATPTTP 


JDID 


5651 


CAUUAAUA A UGGAAAUA 


1289 


TATTTCCA GGCTAGCTACAACGA TATTAATG 


3617 


5657 


UAAUGGAA A UAAUUGAA 


1290 


TTCAATTA GGCTAGCTACAACGA TTCCATTA 


3618 


5660 


UGGAAAUA A UUGAAUAG 


1291 


CTATTCAA GGCTAGCTACAACGA TATTTCCA 


3619 


5665 


AUAAUUGA A UAGUUAGU 


1292 


ACTAACTA GGCTAGCTACAACGA TCAATTAT 


3620 


5668 


AUUGAAUA G UUAGUUAU 


1293 


ATAACTAA GGCTAGCTACAACGA TATTCAAT 


3621 


5672 


AAUAGUUA G UUAUGUAU 


1294 


ATACATAA GGCTAGCTACAACGA TAACTATT 


3622 


5675 


AGUUAGUU A UGUAUGUU 


1295 


AACATACA GGCTAGCTACAACGA AACTAACT 


3623 


5677 


UUAGUUAU G UAUGUUAA 


1296 


TTAACATA GGCTAGCTACAACGA ATAACTAA 


3624 
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5679 


AGUUAUGU A UGUUAAUG 


1297 


CATTAACA GGCTAGCTACAACGA AtAiAALl 


3625 


5681 


UUAUGUAU G UUAAUGCC 


1298 


j _ . __ _ _ /««iMt^mii /i nmiv /T*?v tv /^t/'h 7V T\ ^*7V f^TV 'l*7\ TV 

GGCATTAA GGCTAGCTACAACGA A1ALAIAA 




5685 


GUAUGUUA A UGCCAGUC 


1299 


GACTGGCA GGCTAGCTACAACGA TAAtAlAL 


*}fi97 


5687 


AUGUUAAU G CCAGUCAC 


1300 


-- ru_r-» j /-i /vnii <t t\ tv nnii TV 'iw MTV TV f"**7V • 1* 

GTGACTGG GGCTAGCTACAACGA ATu.AA.OAl 


3628 


5691 


UAAUGCCA G UCACCAGC 


1301 


GCTGGTGA GGCTAGCTACAACGA TbbLAl 1A 


J O _7 


5694 


UGCCAGUC A CCAGCAGG 


1302 


CCTGCTGG GGCTAGCTACAACGA GACTGGCA 


0 00 u 


5698 


AGUCACCA G CAGGCUAU 


1303 


ATAGCCTG GGCTAGCTACAACGA TGGTGAL 1 


0 OO X 


5702 


ACCAGCAG G CUAUUUCA 


1304 


TGAAATAG GGCTAGCTACAACGA CTGCTGC3I 




5705 


AGCAGGCU A UUUCAAGG 


1305 


CCTTGAAA GGCTAGCTACAACGA AGCCTGCT 


ODJJ 


5713 


AUUUCAAG G UCAGAAGU 


1306 


ACTTCTGA GGCTAGCTACAACGA CTTGAAAT 




5720 


GGUCAGAA G UAAUGACU 


1307 


— .- ■ 1 mj |i -I. n/^rtiTl'h /— * ^"<ff >T TV /fTV TV /"T/"t T\ f I If TV ^^f^ 

AGTCATTA GGCTAGCTACAACGA TTCrGACC 


JOJJ 


5723 


CAGAAGUA A UGACUCCA 


13 08 


TGGAGTCA GGCTAGCTACAACGA TAU1 


J D O O 


5726 


AAGUAAUG A CUCCAUAC 


1309 


GTATGGAG GGCTAGCTACAACGA tAiiALii 


0 0 j / 


5731 


AUGACUCC A UACAUAUU 


1310 


AATATGTA GGCTAGCTACAACGA CaVaAtjlv-Ai 


0000 


5733 


GACUCCAU A CAUAUUAU 


1311 


ATAATATG GGCTAGCTACAACGA ATGGAGTC 




5735 


CUCCAUAC A UAUUAUUU 


1312 


AAATAATA GGCTAGCTACAACGA GTATGGAG 


JO*u 


5737 


CCAUAC AU A U UAU U UAU 




ATAAATAA GGCTAGCTACAACGA ATGTATGG 


3641 


5740 


UACAUAUU A UUUAUUUC 


1314 


GAAATAAA GGCTAGCTACAACGA AATATGTA 


3642 


5744 


UAUUAUUU A UUUCUAUA 


1315 


TATAGAAA GGCTAGCTACAACGA AAATAATA 


3643 


5750 


UUAUUUCU A UAACUACA 


1316 


TGTAGTTA GGCTAGCTACAACGA AGAAATAA 


3644 


5753 


UUUCUAUA A CUACAUUU 


1317 


AAATGTAG GGCTAGCTACAACGA TATAGAAA 


3645 


5756 


CUAUAACU A CAUUUAAA 


1318 


TTTAAATG GGCTAGCTACAACGA AGTTATAG 


3646 


5758 


AUAACUAC A UUUAAAUC 


1319 


GATTTAAA GGCTAGCTACAACGA GTAGTTAT 


3647 


5764 


ACAUUUAA A UCAUUACC 


1320 


GGTAATGA GGCTAGCTACAACGA TTAAATGT 


3648 


5767 


UUUAAAUC A UUACCAGG 


1321 


CCTGGTAA GGCTAGCTACAACGA GATTTAAA 


3649 



Input Sequence = NM_004985. Cut Site = R/Y 
Arm Length = 8. Core Sequence = GGCTAGCTACAACGA 
NM_004985 (Homo sapiens v-Ki-ras2 Kirsten rat sarcoma 2 viral 
(KRas2), mRNA; 5775 nt) 



oncogene homolog 
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Table HI: Human H-Ras DNAzyme and Target molecules 



Pos 


Substrate 


Seq 
ID 


DNAzyme 


Seq 
ID 


. 9 


GGAUCCCA G CCUUUCCC 


1322 


GGGAAAGG GGCTAGCTACAACGA TGGGATCC 


3650 


20 


UUUCCCCA G CCCGUAGC 


1323 


GCTACGGG GGCTAGCTACAACGA TGGGGAAA 


3651 


24 


CCCAGCCC G UAGCCCCG 


1324 


CGGGGCTA GGCTAGCTACAACGA GGGCTGGG 


3652 


. 27 


AGCCCGUA G CCCCGGGA 


1325 


TCCCGGGG GGCTAGCTACAACGA TACGGGCT 


3653 


35 


GCCCCGGG A CCUCCGCG 


1326 


CGCGGAGG GGCTAGCTACAACGA CCCGGGGC 


3654 


41 


GGACCUCC G CGGUGGGC 


1327 


GCCCACCG GGCTAGCTACAACGA GGAGGTCC 


3655 


. 44 


CCUCCGCG G UGGGCGGC 


1328 


GCCGCCCA GGCTAGCTACAACGA CGCGGAGG 


3656 


48 


CGCGGUGG G CGGCGCCG 


1329 


CGGCGCCG GGCTAGCTACAACGA CCACCGCG 


3657 


. 51 


GGUGGGCG G CGCCGCGC 


1330 


GCGCGGCG GGCTAGCTACAACGA CGCCCACC 


3658 


53 


UGGGCGGC G CCGCGCUG 


1331 


CAGCGCGG GGCTAGCTACAACGA GCCGCCCA 


3659 


56 


GCGGCGCC G CGCUGCCG 


1332 


CGGCAGCG GGCTAGCTACAACGA GGCGCCGC 


3660 


58 


GGCGCCGC G CUGCCGGC 


1333 


GCCGGCAG GGCTAGCTACAACGA GCGGCGCC 


3661 


61 


GCCGCGCU G CCGGCGCA 


1334 


TGCGCCGG GGCTAGCTACAACGA AGCGCGGC 


3662 


65 


CGCUGCCG G CGCAGGGA 


1335 


TCCCTGCG GGCTAGCTACAACGA CGGCAGCG 


3663 


67 


CUGCCGGC G CAGGGAGG 


1336 


CCTCCCTG GGCTAGCTACAACGA GCCGGCAG 


3664 


76 


CAGGGAGG G CCUCUGGU 


1337 


ACCAGAGG GGCTAGCTACAACGA CCTCCCTG 


3665 


83 


GGCCUCUG G UGCACCGG 


1338 


CCGGTGCA GGCTAGCTACAACGA CAGAGGCC 


3666 


85 


CCUCUGGU G CACCGGCA 


1339 


TGCCGGTG GGCTAGCTACAACGA ACCAGAGG 


3667 


87 


UCUGGUGC A CCGGCACC 


1340 


GGTGCCGG GGCTAGCTACAACGA GCACCAGA 


3668 ! 


91 


GUGCACCG G CACCGCUG 


1341 


CAGCGGTG GGCTAGCTACAACGA CGGTGCAC 


3669 


93 


GCACCGGC A CCGCUGAG 


1342 


CTCAGCGG GGCTAGCTACAACGA GCCGGTGC 


3670 


96 


CCGGCACC G CUGAGUCG 


1343 


CGACTCAG GGCTAGCTACAACGA GGTGCCGG 


3671 


101 


ACCGCUGA G UCGGGUUC 


1344 


GAACCCGA GGCTAGCTACAACGA TCAGCGGT 


3672 


106 


UGAGUCGG G UUCUCUCG 


1345 


CGAGAGAA GGCTAGCTACAACGA CCGACTCA 


3673 


114 


GUUCUCUC G CCGGCCUG 


1346 


CAGGCCGG GGCTAGCTACAACGA GAGAGAAC 


3674 


118 


UCUCGCCG G CCUGUUCC 


1347 


GGAACAGG GGCTAGCTACAACGA CGGCGAGA 


3675 


122 


GCCGGCCU G UUCCCGGG 


1348 


CCCGGGAA GGCTAGCTACAACGA AGGCCGGC 


3676 


134 


CCGGGAGA G CCCGGGGC 


1349 


GCCCCGGG GGCTAGCTACAACGA TCTCCCGG 


3677 


141 


AGCCCGGG G CCCUGCUC 


1350 


GAGCAGGG GGCTAGCTACAACGA CCCGGGCT 


3678 


146 


GGGGCCCU G CUCGGAGA 


1351 


TCTCCGAG GGCTAGCTACAACGA AGGGCCCC 


3679 


154 


GCUCGGAG A UGCCGCCC 


1352 


GGGCGGCA GGCTAGCTACAACGA CTCCGAGC 


3680 


156 


UCGGAGAU G CCGCCCCG 


1353 


CGGGGCGG GGCTAGCTACAACGA ATCTCCGA 


3681 ! 


15.9 


GAGAUGCC G CCCCGGGC 


1354 


GCCCGGGG GGCTAGCTACAACGA GGCATCTC 


3682 


166 


CGCCCCGG G CCCCCAGA 


1355 


TCTGGGGG GGCTAGCTACAACGA CCGGGGCG 


3683 


174 


GCCCCCAG A CACCGGCU 


1356 


AGCCGGTG GGCTAGCTACAACGA CTGGGGGC 


3684 


176 


CCCCAGAC A CCGGCUCC 


1357 


GGAGCCGG GGCTAGCTACAACGA GTCTGGGG 


3685 


180 


AGACACCG G CUCCCUGG 


1358 


CCAGGGAG GGCTAGCTACAACGA CGGTGTCT 


3686 


188 


GCUCCCUG G CCUUCCUC 


1359 


GAGGAAGG GGCTAGCTACAACGA CAGGGAGC 


3687 


199 


UUCCUCGA G CAACCCCG 


1360 


CGGGGTTG GGCTAGCTACAACGA TCGAGGAA 


3688 


202 


CUCGAGCA A CCCCGAGC 


1361 


GCTCGGGG GGCTAGCTACAACGA TGCTCGAG 


3689 


209 


AACCCCGA G CUCGGCUC 


1362 


GAGCCGAG GGCTAGCTACAACGA TCGGGGTT 


3690 


214 


CGAGCUCG G CUCCGGUC 


1363 


GACCGGAG GGCTAGCTACAACGA CGAGCTCG 


3691 


220 


CGGCUCCG G UCUCCAGC 


1364 


GCTGGAGA GGCTAGCTACAACGA CGGAGCCG 


3692 


227 


GGUCUCCA G CCAAGCCC 


1365 


GGGCTTGG GGCTAGCTACAACGA TGGAGACC 


3693 


232 


CCAGCCAA G CCCAACCC 


1366 


GGGTTGGG GGCTAGCTACAACGA TTGGCTGG 


3694 


237 


CAAGCCCA A CCCCGAGA 


1367 


TCTCGGGG GGCTAGCTACAACGA TGGGCTTG 


3695 


247 


CCCGAGAG G CCGCGGCC 


1368 


GGCCGCGG GGCTAGCTACAACGA CTCTCGGG 


3696 


250 


GAGAGGCC G CGGCCCUA 


1369 


TAGGGCCG GGCTAGCTACAACGA GGCCTCTC 


3697 
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253 


AGGCCGCG G CCCUACUG 


1370 


CAGTAGGG GGCTAGCTACAACGA CGCGGCCT 


3698 


258 


GCGGCCCU A CUGGCUCC 


1371 


GGAGCCAG GGCTAGCTACAACGA AGGGCCGC 


3699 


262 


CCCUACUG G CUCCGCCU 


1372 


AGGCGGAG GGCTAGCTACAACGA CAGTAGGG 


3700 


267 


CUGGCUCC G CCUCCCGC 


1373 


GCGGGAGG GGCTAGCTACAACGA GGAGCCAG 


3701 


274 


CGCCUCCC G CGUUGCUC 


1374 


GAGCAACG GGCTAGCTACAACGA GGGAGGCG 


3702 


276 


CCUCCCGC G UUGCUCCC 


1375 


GGGAGCAA GGCTAGCTACAACGA GCGGGAGG 


3703 


279 


CCCGCGUU G CUCCCGGA 


1376 


TCCGGGAG GGCTAGCTACAACGA AACGCGGG 


3704 


289 


UCCCGGAA G CCCCGCCC 


1377 


GGGCGGGG GGCTAGCTACAACGA TTCCGGGA 


3705 


294 


GAAGCCCC G CCCGACCG 


1378 


CGGTCGGG GGCTAGCTACAACGA GGGGCTTC 


3706 


299 


CCCGCCCG A CCGCGGCU 


1379 


AGCCGCGG GGCTAGCTACAACGA CGGGCGGG 


3707 


302 


GCCCGACC G CGGCUCCU 


1380 


AGGAGCCG GGCTAGCTACAACGA GGTCGGGC 


3708 


305 


CGACCGCG G CUCCUGAC 


1381 


GTCAGGAG GGCTAGCTACAACGA CGCGGTCG 


3709 


312 


GGCUCCUG A CAGACGGG 


1382 


CCCGTCTG GGCTAGCTACAACGA CAGGAGCC 


3710 


316 


CCUGACAG A CGGGCCGC 


1383 


GCGGCCCG GGCTAGCTACAACGA CTGTCAGG 


3711 


320 


ACAGACGG G CCGCUCAG 


1384 


CTGAGCGG GGCTAGCTACAACGA CCGTCTGT 


3712 


323 


GACGGGCC G CUCAGCCA 


1385 


TGGCTGAG GGCTAGCTACAACGA GGCCCGTC 


3713 


328 


GCCGCUCA G CCAACCGG 


1386 


CCGGTTGG GGCTAGCTACAACGA TGAGCGGC 


3714 


332 


CUCAGCCA A CCGGGGUG 


1387 


CACCCCGG GGCTAGCTACAACGA TGGCTGAG 


3715 


338 


CAACCGGG G UGGGGCGG 


1388 


CCGCCCCA GGCTAGCTACAACGA CCCGGTTG 


3716 


343 


GGGGUGGG G CGGGGCCC 


1389 


GGGCCCCG GGCTAGCTACAACGA CCCACCCC 


3717 


348 
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CTGCAGCG GGCTAGCTACAACGA GCCTACCT 


3981 


1477 


PTTAPPPAP P PTTPPAPTTP 
blXrlbbl— tlx*, \J LUurLAuUL 


1654 


A Pn^P P A C** r^t~*i MA PPrn A PA A /"^/ • A ^rfi/"»OnmTi 

GAb-xGbAG GGCxAGCTACAACGA GTGCCTAC 


3982 


1480 


PPPAPPPTT P PAPTTPPITTT 
VjVjrV__M.v_.vjv_.vj Vjt LiioU LLUU 


1655 


A A PP A /""•"PP PpprpAPPT'A PA A PP A A P PPTV»^"'/""» 

AAbiGAb. 1G GGCTAGCTACAACGA AGCGTGCC 
— — — ,. 1 


3983 


1483 


APPPTTPPA P TTPPTTTTPPTT 


1656 


APPA APPA PPPrpAPPfTPA PA A P/^ A rnPPTAnnriin 

AGCAAGGA GGCTAGCTACAACGA TGCAGCGT 


3984 


1489 


PAPTTPPTTTT P PTTPPPTTPP 

wVJULUUU *J \_ VJ OL.V^ VJVJVJ 


1657 


c^r^f^c^r^f^Ts.r^ ppptrpptr pa a pp a a appa pt>p 
b. bAbib? bAb» biGC X Abib- X AbAAbJGA AAGGAC X G 


3985 


1492 


TTPPTTTTPPTT P PPTTPPPPTT 

UVvV^UUwLiU w V_V_ V_Jbb\_Aj W 


1658 


A PP PP A P P ppprpAPPT'A PA A PP A APPA APPA 

AbbbbAbb GGb- X AG b. X AbAAbbA AGCAAGGA 


3986 


1497 


PPT TPPPT TP P PPTTTTPPPP 
vjv_, vjvjjv_,v_.\j\j VJ L.uUUuuvAj 


1659 


p/~»/-i/^ A A PP PPPrpAPPrpA PA A PP A PHPPPl\rT> 

bbbbAAbb bbb 1 Abb 1 AbAAbbA CAGbCAbb. 


3987 


1499 


IJPPPTTPPP P TTT TPPPPPP 

UOvUUUOU w U vJOVjVjVJbb 


1660 


P/~» P P P P A A PpprpAPPT'A PA A PP A /"'OPAPPPA 

bbbbbbAA GbCXAbCXACAACGA GCCAGGCA 


3988 


1505 


pppttttppp P rrpafjnnB 

OV-Vjuuuuu vj V_.v_.v ttvjrVjrtji 1 -. 


1661 


TPPPTPPP PPPTPAPprp A PA A PP A /-tPPA A PPP 

X C Cb. X bbb bbb, X Abb X AbAAbbA CCbAACGC 


3989 


1513 


PPPPAPPP A PPPPTTPTTP 


1662 


PAPAPPPP pp PT» A P PT 1 A P A A PP A /~ t /~*f~*T'f~i/~ , /~'/~* 

bAbAb bbb bbb X Abb X AbAAbbA bbb Xbbbb 


3990 


1516 


PAPPPAPP P PTTPT TPPPT T 


1663 


A PPP A PAP PpprpAPPT'A PA A PP A f~*t~"T'/~*f*i~VT l /~i 

Abb bAbAb bbb X AbbXAbAAbGA GGlbCbxG 


3991 


1519 


PPAPPPPTT P T TPPPT TTTT TP 

VJVJ.rt.V_.V_.VJV_.vJ VJ UVjV3UUUUu 


1664 


PA A A PPP A /^»P PT« A P P«T» A P A A PP A a PPPPTPP 

bAAAbbbA GGb X Abb X AbAAbbA AbbbGTbb 


3992 


1523 


PPPTTPTTPP P TTTTTTPPPPTT 

V^UUUUUUvJ VJ VJ V_* VJVJV—V_,V_. (J 


1665 


A PPP P A A A pppfpTippm* PA A PP A PP A r^T^r^tT* 

AbbbbAAA bbb X Abb X AbAAbbA bbAbAbbb 


3993 


1527 


PT TPPPT TT TTT P PPPTTTTPAP 

VJ VJ VJVJVJ VJ VJ VJ V— J LUUUUwiU 


1666 


prvn a ppp nr* pt a p pt a p a a pp a aaapppap 
b XbAAbbb bbb X Abb X AbAAbbA AAAbbbAb 


3994 


1*536 


PPPTTTTPAP A TTPPPPPTTP 


1667 


t~ % >if~*d/~ , r**t~^']\ PPPTAPPTAPA APPA PTPAAPPP 

bAbbbbbA bbbXAbCTAbAAbGA CTGAAGGG 


3995 


1539 


UTTPAPATIP P PPPTTPPPA 


1668 


TPPPAPPP PPPT APPTA PA A PP A PATPTPAA 

IbbbAbbb bbb X Abb X AbAAbbA bAXbXbAA 


3996 


1544 


atippppptt p ppanpanp 

nUVJVJUUCU VJ V_ V__rt.Vj l^fVuL 


1669 


P PP ( m 1 *PP PPPT APPTA PA A PP A APPPPPAT* 

bb X bb Xbb GGCTAGCTACAAbGA AGGGCCAT 


3997 


1548 


PPPTTPPPA P PSPPI TPPP 
UL^UOtLn Vj V__r_.VjV_.VJ bbb 


1670 


GGCAGbXG GGCTAGCTACAACGA TGGCAGGG 


3998 


1551 


TTPPPAPPA p pttppppttp 

UOULnuLii vj bUbbbbUb 


1671 


r^APPPPA P ppptti PPm a p a a PP a rnnnTnnnTt 

bAGGGCAG GGCTAGCTACAACGA TGCTGGCA 


3999 


1554 


PAPPAPPTT P PPPTTPTTPP 
V__rtvj LnULU O LLLUuUuu 


1672 


CCACAGGG GGbTAGCTACAACGA AGCTGCTG 


4000 


1559 


GCUGCCCU G UGGGGCCU 


lb /«3 


AGGCCCCA GGCTAGCTACAACGA AGGGCAGC 


4001 


1564 


CCUGUGGG G CCUGGGGC 


1674 


GCCCCAGG GGCTAGCTACAACGA CCCACAGG 


4002 


1571 


GGCCUGGG G CUGGGCCU 


1675 


AGGCCCAG GGCTAGCTACAACGA CCCAGGCC 


4003 


1576 


GGGGCUGG G CCUGGGCC 


1676 


GGCCCAGG GGCTAGCTACAACGA CCAGCCCC 


4004 


1582 


GGGCCUGG G CCUGGCUG 


1677 


CAGCCAGG GGCTAGCTACAACGA CCAGGCCC 


4005 


1587 


UGGGCCUG G CUGAGCAG 


1678 


CTGCTCAG GGCTAGCTACAACGA CAGGCCCA 


4006 


1592 


CUGGCUGA G CAGGGCCC 


1679 


GGGCCCTG GGCTAGCTACAACGA TCAGCCAG 


4007 


1597 


UGAGCAGG G CCCUCCUU 


1680 


AAGGAGGG GGCTAGCTACAACGA CCTGCTCA 


4008 


1607 


CCUCCUUG G CAGGUGGG 


1681 


CCCACCTG GGCTAGCTACAACGA CAAGGAGG 


4009 
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1611 



1616 



1623 



1628 



1636 



1643 



1646 



1650 



1661 



1664 



1667 



1672 



1674 



CUUGGCAG G UGGGGCAG 1682 



CAGGUGGG G CAGGAGAC 1683 



GGCAGGAG A CCCUGUAG x684 



GAGACCCU G UAGGAGGA X685 



GUAGGAGG A CCCCGGGC I X686 



GACCCCGG G CCGCAGGC X687 



CCCGGGCC G CAGGCCCC 168 8 



GGCCGCAG G CCCCUGAG 1689 



CCUGAGGA G CGAUGACG X690 



GAGGAGCG A UGACGGAA X 691 



GAGCGAUG A CGGAAUAU 1692 



AUGACGGA A UAUAAGCU 1693 



GACGGAAU A UAAGCUGG 169 4 



1678 



1682 



1685 



GAAUAUAA G CUGGUGGU 169 5 



AUAAGCUG G UGGUGGUG 169 g 



1688 



1692 



1694 



1698 



1701 



1703 



AGCUGGUG G UGGUGGGC x697 



UGGUGGUG G UGGGCGCC X698 



GGUGGUGG G CGCCGGCG X 699 



UGGUGGGC G CCGGCGGU 1700 



GGGCGCCG G CGGUGUGG 17 qi 



CGCCGGCG G UGUGGGCA 1702 



1707 



1713 



1715 



1717 



1721 



1724 



1729 



1733 



1740 



1743 



1748 



1752 



1756 



1758 



1761 



1766 



1769 



1776 



1782 



1789 



1793 



CCGGCGGU G UGGGCAAG X 703 



CGGUGUGG G CAAGAGUG X704 



GGGCAAGA G UGCGCUGA 170 5 



GCAAGAGU G CGCUGACC 170 6 



AAGAGUGC G CUGACCAU X 707 



GUGCGCUG A CCAUCCAG X708 



CGCUGACC A UCCAGCUG 1709 



ACCAUCCA G CUGAUCCA 171 0 



UCCAGCUG A UCCAGAAC 17 n 



CTGCCCCA GGCTAGCTACAACGA CTGCCAAG 



GTCTCCTG GGCTAGCTACAACGA CCCACCTG 



CTACAGGG GGCTAGCTACAACGA CTCCTGCC 



TCCTCCTA GGCTAGCTAC AACGA AGGGTCTC 
"GCCCGGGG GGCT AGCTACAACGA CCTCCTAC 
GCCTGCGG GGCTAGCTACAACGA CCGGGGTC 



GGGGCCTG GGCTAGCTACAACGA GGCCCGGG 



CTCAGGGG GGCTAGCTACAACGA CTGCGGCC 



CGTCATCG GGCTAGCTACAACGA TCCTCAGG 



"TTCCGTCA GGCTAGCTACAACGA CGCTCCTC 



ATATTCCG GGCTAGCTACAACGA CATCGCTC 



AGCTTATA GGCTAGCTACAACGA TCCGTCAT 



CCAGCTTA GGCTAGCTACAACGA ATTCCGTC 



ACCACCAG GGCTAGCTACAACGA TTATATTC 



CACCACCA GGCTAGCTACAACGA CAGCTTAT 



GCCCACCA GGCTAGCTACAACGA CACCAGCT 



GGCGCCCA GGCTAGCTACAACGA CACCACCA 



CGCCGGCG GGCTAGCTACAACGA CCACCACC 



"ACCGCCGG GGCTAGCTACAACGA GCCCACCA 



CCACACCG GGCTAGCTACAACGA CGGCGCCC 



TGCCCACA GGCTAGCTACAACGA CGCCGGCG 



CTTGCCCA GGCTAGCTACAACGA ACCGCCGG 



CACTCTTG GGCTAGCTACAACGA CCACACCG 



TCAGCGCA GGCTAGCTACAACGA TCTTGCCC 



GGTCAGCG GGCTAGCTACAACGA ACTCTTGC 



ATGGTCAG GGCTAGCTACAACGA GCACTCTT 



CTGGATGG GGCTAGCTACAACGA CAGCGCAC 



"CAGCTGGA GGCTAGCTACAACGA GGTCAGCG 



TGGATCAG GGCTAGCTACAACGA TGGATGGT 



GTTCTGGA GGCTAGCTACAACGA CAGCTGGA 



GAUCCAGA A CCAUUUUG X 712 



CCAGAACC A UUUUGUGG X 713 



ACCAUUUU G UGGACGAA X714 



UUUUGUGG A CGAAUACG X715 



GUGGACGA A UACGACCC X 716 



GGACGAAU A CGACCCCA 1717 



CGAAUACG A CCCCACUA 1718 



CAAAATGG GGCTAGCTACAACGA TCTGGATC 



CCACAAAA GGCTAGCTACAACGA GGTTCTGG 



TTCGTCCA GGCTAGCTACAACGA AAAATGGT 



CGTATTCG GGCTAGCTACAACGA CCACAAAA 



"GGGTCGTA GGCTAGCTACAACGA TCGTCCAC 



TGGGGTCG GGCTAGCTACAACGA ATTCGTCC 



TAGTGGGG GGCTAGCTACAACGA CGTATTCG 



ACGACCCC A CUAUAGAG 



ACCCCACU A UAGAGGAU 



1719 
1720 



UAUAGAGG A UUCCUACC 17 21 



GGAUUCCU A CCGGAAGC 



UACCGGAA G CAGGUGGU 



1722 
1723 



GGAAGCAG G UGGUCAUU 17 24 



4010 



' GGTAGGAA GGCTAGCTACAACGA CCTCTATA 



"GCTTCCGG GGCTAGCTACAACGA AGGAATCC 



ACCACCTG GGCTAGCTACAACGA TTCCGGTA 



AATGACCA GGCTAGCTACAACGA 



CTGCTTCC 
CACCTGCT 



4011 



4012 



4013 



4014 



4015 



4016 



4017 



4018 



4019 



4020 



4021 



4022 



4023 



4024 



4025 



4026 



4027 



4028 



4029 



4030 



4031 



4032 



4033 



4034 



4035 



4036 



4037 



4038 



4039 



4040 
4041 



4042 



4043 



4044 



4045 



4046 



CTCTATAG GGCTAGCTA CAACGA GGGGTCGT 1 404-7 
ATCCTCTA GGCTAGCTACAACGA AGTGGGGT | 4048 



4049 



4050 



4051 



4052 



1796 



AGCAGGUG G UCAUUGAU 172 5 



ATCAATGA GGCTAGCTACAACGA 



1799 



AGGUGGUC A UUGAUGGG 172 6 



CCCATCAA GGCTAGCTACAACGA GACCACCT 



1803 



GGUCAUUG A UGGGGAGA X 727 



TCTCCCCA GGCTAGCTACAACGA CAATGACC 



1811 



AUGGGGAG A CGUGCCUG 172 8 



CAGGCACG GGCTAGCTACAACGA CTCCCCAT 



1813 



GGGGAGAC G UGCCUGUU 1729 



AACAGGCA GGCTAGCTACAACGA GTCTCCCC 



1815 



GGAGACGU G CCUGUUGG X730 



CCAACAGG GGCTAGCTACAACGA 



1819 



ACGUGCCU G UUGGACAU 173 x 



ATGTCCAA GGCTAGCTACAACGA 



ACGTCTCC 
AGGCACGT 



1824 



CCUGUUGG A CAUCCUGG X 732 



1826 



UGUUGGAC A UCCUGGAU X 733 



CCAGGATG 



ATCCAGGA 



GGCTAGCTACAACGA 
GGCTAGCTACAACGA 



CCAACAGG 



GTCCAACA 



4053 



4054 



4055 



4056 



4057 



4058 



4059 



4060 



4061 
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1833 


CAUCCUGG A VACCGCCG 


1734 


CGGCGGTA GGCTAGCTACAACGA CCAGGATG 


4062 


1835 


UCCUGGAU A CCGCCGGC 


1735 


GCCGGCGG GGCTAGCTACAACGA ATCCAGGA 


4063 


1838 


UGGAUACC G CCGGCCAG 


1736 


CTGGCCGG GGCTAGCTACAACGA GGTATCCA 


4064 


1842 


UACCGCCG G CCAGGAGG 


1737 


CCTCCTGG GGCTAGCTACAACGA CGGCGGTA 


4065 


1852 


CAGGAGGA G UACAGCGC 


1738 


GCGCTGTA GGCTAGCTACAACGA TCCTCCTG 


4066 


1854 


GGAGGAGU A CAGCGCCA 


1739 


TGGCGCTG GGCTAGCTACAACGA ACTCCTCC 


4067 


1857 


GGAGUACA G CGCCAUGC 


1740 


GCATGGCG GGCTAGCTACAACGA TGTACTCC 


1UDO 


1859 


AGUACAGC G CCAUGCGG 


1741 


CCGCATGG GGCTAGCTACAACGA GCTGTACT 




1862 


ACAGCGCC A UGCGGGAC 


1742 


GTCCCGCA GGCTAGCTACAACGA GGCGCTGT 


A c\n n 


1864 


AGCGCCAU G CGGGACCA 


1743 


TGGTCCCG GGCTAGCTACAACGA ATGGCGCT 


*± \J 1 X 


1869 


CAUGCGGG A CCAGUACA 


1744 


TGTACTGG GGCTAGCTACAACGA CCCGCATG 




1873 


CGGGACCA G UACAUGCG 


1745 


CGCATGTA GGCTAGCTACAACGA TGGTCCCG 




- 1875 


GGACCAGU A CAUGCGCA 


1746 


TGCGCATG GGCTAGCTACAACGA ACTGGTCC 


A H*7 A 


1877 


ACCAGUAC A UGCGCACC 


1747 


GGTGCGCA GGCTAGCTACAACGA GTACTGGT 


*± \J f o 


1879 


CAGUACAU G CGCACCGG 


1748 


CCGGTGCG GGCTAGCTACAACGA ATGTACTG 


Ann c 


1881 


GUACAUGC G CACCGGGG 


1749 


CCCCGGTG GGCTAGCTACAACGA GCATGTAC 


A(\T7 


1883 


ACAUGCGC A CCGGGGAG 


1750 


CTCCCCGG GGCTAGCTACAACGA GCGCATGT 


A A*7 Q 


1893 


CGGGGAGG G CUUCCUGU 


1751 


ACAGGAAG GGCTAGCTACAACGA CCTCCCCG 


A r>*7 o 


1900 


GGCUUCCU G UGUGUGUU 


1752 


AACACACA GGCTAGCTACAACGA AGGAAGCC 


^UoU 


1902 


CUUCCUGU G UGUGUUUG 


1753 


CAAACACA GGCTAGCTACAACGA ACAGGAAG 


ftUoi 


1904 


UCCUGUGU G UGUUUGCC 


1754 


GGCAAACA GGCTAGCTACAACGA ACACAGGA 


A t\ Q *> 


1906 


CUGUGUGU G UUUGCCAU 


1755 


ATGGCAAA GGCTAGCTACAACGA ACACACAG 


Af\Q1 

TWO J 


1910 


GUGUGUUU G CCAUCAAC 


1756 


GTTGATGG GGCTAGCTACAACGA AAACACAC 


A Cl Q A 


1913 


UGUUUGCC A UCAACAAC 


1757 


GTTGTTGA GGCTAGCTACAACGA GGCAAACA 




1917 


UGCCAUCA A CAACACCA 


1758 


TGGTGTTG GGCTAGCTACAACGA TGATGGCA 


A n Q c 


1920 


CAUCAACA A CACCAAGU 


1759 


ACTTGGTG GGCTAGCTACAACGA TGTTGATG 


4UO / 


1922 


UCAACAAC A CCAAGUCU 


1760 


AGACTTGG GGCTAGCTACAACGA GTTGTTGA 


ACiQ.fl 


1927 


AACACCAA G UCUUUUGA 


1761 


TCAAAAGA GGCTAGCTACAACGA TTGGTGTT 


*t UOJ7 


1938 


UUUUGAGG A CAUCCACC 


1762 


GGTGGATG GGCTAGCTACAACGA CCTCAAAA 




1940 


UUGAGGAC A UCCACCAG 


1763 


CTGGTGGA GGCTAGCTACAACGA GTCCTCAA 




1944 


GGACAUCC A CCAGUACA 


1764 


TGTACTGG GGCTAGCTACAACGA GGATGTCC 




1948 


AUCCACCA G UACAGGGA 


1765 


TCCCTGTA GGCTAGCTACAACGA TGGTGGAT 




1950 


CCACCAGU A CAGGGAGC 


1766 


GCTCCCTG GGCTAGCTACAACGA ACTGGTGG 




1957 


UACAGGGA G CAGAUCAA 


1767 


TTGATCTG GGCTAGCTACAACGA TCCCTGTA 


*± \J .2 -J 


1961 


GGGAGCAG A UCAAACGG 


1768 


CCGTTTGA GGCTAGCTACAACGA CTGCTCCC 


4096 


1966 


CAGAUCAA A CGGGUGAA 


1769 


TTCACCCG GGCTAGCTACAACGA TTGATCTG 


4097 


1970 


UCAAACGG G UGAAGGAC 


1770 


GTCCTTCA GGCTAGCTACAACGA CCGTTTGA 


A 0 OP 


1977 


GGUGAAGG A CUCGGAUG 


1771 


CATCCGAG GGCTAGCTACAACGA CCTTCACC 




1983 


GGACUCGG A UGACGUGC 


1772 


GCACGTCA GGCTAGCTACAACGA CCGAGTCC 


*± X \J KJ 


1986 


CUCGGAUG A CGUGCCCA 


1773 


TGGGCACG GGCTAGCTACAACGA CATCCGAG 


ai m 

ft X \J X 


1988 


CGGAUGAC G UGCCCAUG 


1774 


CATGGGCA GGCTAGCTACAACGA GTCATCCG 


41 CiO 


1990 


GAUGACGU G CCCAUGGU 


1775 


ACCATGGG GGCTAGCTACAACGA ACGTCATC 


Aim ' 


1994 


ACGUGCCC A UGGUGCUG 


1776 


CAGCACCA GGCTAGCTACAACGA GGGCACGT 


A1 f\A 


1997 


UGCCCAUG G UGCUGGUG 


1777 


CACCAGCA GGCTAGCTACAACGA CATGGGCA 


4105 


1999 


CCCAUGGU G CUGGUGGG 


1778 


CCCACCAG GGCTAGCTACAACGA ACCATGGG 


4106 


2003 


UGGUGCUG G UGGGGAAC 


1779 


GTTCCCCA GGCTAGCTACAACGA CAGCACCA 


4107 


2010 


GGUGGGGA A CAAGUGUG 


1780 


CACACTTG GGCTAGCTACAACGA TCCCCACC 


4108 


2014 


GGGAACAA G UGUGACCU 


1781 


AGGTCACA GGCTAGCTACAACGA TTGTTCCC 


4109 


2016 


GAACAAGU G UGACCUGG 


1782 


CCAGGTCA GGCTAGCTACAACGA ACTTGTTC 


4110 


2019 


CAAGUGUG A CCUGGCUG 


1783 


CAGCCAGG GGCTAGCTACAACGA CACACTTG 


4111 


2024 


GUGACCUG G CUGCACGC 


1784 


GCGTGCAG GGCTAGCTACAACGA CAGGTCAC 


4112 


2027 


ACCUGGCU G CACGCACU 


1785 


AGTGCGTG GGCTAGCTACAACGA AGCCAGGT 


4113 
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2029 




1786 


ACAGTGCG GGCTAGCTACAACGA GCAGCCAG 


4114 


2031 


nr , nTpr , AP n papttgttGG 

vj\jV_.UvjV-. rVV— Vj V_~rt.v_uvjVJVJVJ 


1787 


CCACAGTG GGCTAGCTACAACGA GTGCAGCC 


4115 


2033 


r t TTP i r , 2ir , rzP a PTTGTTGGAA 

LUuUft^vjL -rt. v^uVjVJvyvjrxri 


1788 


TTCCACAG GGCTAGCTACAACGA GCGTGCAG 


4116 


2036 


r t 7\.rv i r , ftr r rT a ttggaattPU 

lJ^\^Vj\ M »/i.V_.lJ vj uou/vvuuu 


1789 


AGATTCCA GGCTAGCTACAACGA AGTGCGTG 


4117 


2041 


TUTrnTTfTZA a TTPTTPGGPA 
AV— UvjUvjVjrl xi U»~u v_vjvjv_~«~x. 


1790 


TGCCGAGA GGCTAGCTACAACGA TCCACAGT 


4118 


2047 


/-iA2VTTrTTr , rj g paggpttpa. 


1791 


TGAGCCTG GGCTAGCTACAACGA CGAGATTC 


4119 


2051 


rrippppAG g pttpaggaC 

V^U OVj\j\«~rt\J vj u v^~rivjvjr^.v*. 


1792 


GTCCTGAG GGCTAGCTACAACGA CTGCCGAG 


4120 


2058 


nppTTr*AGG a ppttpgppc 

VjVj U LHUO rt. ^v*.uv— vjv_v— 


1793 


GGGCGAGG GGCTAGCTACAACGA CCTGAGCC 


4121 


2063 


tappappttp G r'Pr'fiAAGC! 

AvjVjrt\A-.lJv» vj v^v-vJrvr%vJ>-> 


1794 


" GCTTCGGG GGCTAGCTACAACGA GAGGTCCT 


4122 


2070 


pppo^naA g pttapggpa 


1795 


TGCCGTAG GGCTAGCTACAACGA TTCGGGCG 


4123 


2073 


Cv^vjAAvjVw-U A v-vjvjv-rt.vjv-.v-. 


1796 


" GGATGCCG GGCTAGCTACAACGA AGCTTCGG 


4124 


2076 


A AGPTT A PG G PAUCCCCU 

rtrtVjV-. UrtV— VJ VJ v^riu\rf^»\»\^v 


1797 


AGGGGATG GGCTAGCTACAACGA CGTAGCTT 


4125 


2078 


p , p r nAP , p , f2f"' a TTPPPPTTAP 


1798 


GTAGGGGA GGCTAGCTACAACGA GCCGTAGC 


4126 


2085 


o t\ t Tr , r , r , r r r t a PATTPGAGA 

vJAUv- L-V— V-U rt. \ — rtu V-\j.f-iVJfY 


1799 


TCTCGATG GGCTAGCTACAACGA AGGGGATG 


4127 


2087 


TTPPrpiTar* A TTPGAGAPP 


1800 


GGTCTCGA GGCTAGCTACAACGA GTAGGGGA 


4128 


2093 




1801 


GGCCGAGG GGCTAGCTACAACGA CTCGATGT 


4129 


2099 


T\ P7vppiTPn P PPA AGAPP 
Atirt.v-L-UL.lj vj \^V_-rtrt\Jrt.v-v-, 


1802 


GGTCTTGG GGCTAGCTACAACGA CGAGGTCT 


4130 


2105 


rrincnT^ a PPPGGPAG 

V - AjVjV-V_«rtrtVj rt. \^V-.V-.VJVJV-n\J 


1803 


CTGCCGGG GGCTAGCTACAACGA CTTGGCCG 


4131 


2110 


i\ 7\p , 7v r^OPP a PAGGGAGTT 
AAvjA v- v— Vj vj \^_rtvjvj\jx«j vj 


1804 


ACTCCCTG GGCTAGCTACAACGA CGGGTCTT 


4132 


2117 


r*r*r t Ts.nnm± n TTPGAGGATT 

LjvjV— AvjvjvJrt. Vj UVjvjrtVjVj.tt.U 


1805 


ATCCTCCA GGCTAGCTACAACGA TCCCTGCC 


4133 


2124 


i\.rmr*r*ls.C , r i 7A TT(*2PPTTTTPTT 
AvjUVjvjAVjVj A UVjV_V-UUV-.U 


1806 


AGAAGGCA GGCTAGCTACAACGA CCTCCACT 


4134 


2126 


UvjVjAvjIjAU Vj LLUULUrtC 


1807 


GTAGAAGG GGCTAGCTACAACGA ATCCTCCA 


4135 


2133 


TTfTTTTTTPTT A PAPGTTTTGG 
Uvjv-l— UUL.U rt. V— rtv— vjc/iJVjvj 


1808 


CCAACGTG GGCTAGCTACAACGA AGAAGGCA 


4136 


2135 


LLUULUAL A V-VjUUvjVjvjvj 


1809 


CACCAACG GGCTAGCTACAACGA GTAGAAGG 


4137 


2137 


TTTTmarnP G TTTTnnTTPPn 

UULUAUriL Vj uUuvjuvjv-w 


1810 


CGCACCAA GGCTAGCTACAACGA GTGTAGAA 


4138 


2141 


tv p»t\.pv*tttt<'2 n ttpppttgag 


1811 


CTCACGCA GGCTAGCTACAACGA CAACGTGT 


4139 


2143 


ta rV2TTTTPGTT G PGTTGAGAU 
rt.v-VjU UvjvjVJ Vj v— vjuvj«,vj.riw 


1812 


ATCTCACG GGCTAGCTACAACGA ACCAACGT 


4140 


214t> 


r*7TT7PnTTGP G TTGAGATTPJC 


1813 


GGATCTCA GGCTAGCTACAACGA GCACCAAC 


4141 


21bu 


TTPPPTTGAG A TTPPGGPAG 


1814 


CTGCCGGA GGCTAGCTACAACGA CTCACGCA 


4142 


21bb 


PZ1GATTPPG G PAGCAPAA 


1815 


TTGTGCTG GGCTAGCTACAACGA CGGATCTC 


4143 


21DO 


A T TfPflPP A G PAPAAGPU 
rt.U V_L-VjVjV rt Vj V— n\^tXtVJ\— vs 


1816 


AGCTTGTG GGCTAGCTACAACGA TGCCGGAT 


4144 


2160 


r^cnnn z\ pp a PAAGPTIGP 

V^\— VjljrL-rt.Vj\— rt. v»»r , WVJ»v.lJVjx-» 


1817 


GCAGCTTG GGCTAGCTACAACGA GCTGCCGG 


4145 


2164 


paP-PlPaa G PTTGPGGAA 


1818 


TTCCGCAG GGCTAGCTACAACGA TTGTGCTG 


4146 


2167 


/-iTv/^a 7\mr*TT G PGGAAGPTI 


1819 


AGCTTCCG GGCTAGCTACAACGA AGCTTGTG 


4147 


21 /o 


nTPPPGAA G PTTGAAPPC 

\J VjV_OVXTVrt. V-.WV3rtrt.^V».V- 


1820 


GGGTTCAG GGCTAGCTACAACGA TTCCGCAG 


4148 


21 to 


rJ^AGPTTGA A PPPITPPTJG 


1821 


CAGGAGGG GGCTAGCTACAACGA TCAGCTTC 


4149 


21o / 


PPPTTPPTTG A tTGAGAGTJG 


i a i o 
1822 


CACTCTCA GGCTAGCTACAACGA CAGGAGGG 


4150 


ziyi 


TTPATTPAGA G TTGGPPPPG 


1823 


CGGGGCCA GGCTAGCTACAACGA TCTCATCA 


4151 


/iyb 


TTPAGAGTTG G PPCCGGCU 


1824 


AGCCGGGG GGCTAGCTACAACGA CACTCTCA 


4152 


O O ft o 

22U2 


jinnnnr^nci a PTTGPATTGA 


1825 


TCATGCAG GGCTAGCTACAACGA CGGGGCCA 


4153 


2205 


prrPfirrTT P PATTGAGPTI 
L^v_L^L-LjvjV-.U Vj v-jrivjvj/^.vjv»-»j 


1826 


AGCTCATG GGCTAGCTACAACGA AGCCGGGG 


4154 


2207 


r> rwinnn r* a TTGAGPTTGP 
v^Lajvjv_.UvjV». rt. uvjrt.vji»,uvjVrf 


1827 


GCAGCTCA GGCTAGCTACAACGA GCAGCCGG 


4155 


2211 


nTnr , 7ATTPA G PTTGPAAGTT 

V_ UVjvJAUvjrt. Vj v_. uvjv-/-irt.vj u 


1828 


ACTTGCAG GGCTAGCTACAACGA TCATGCAG 


4156 


2214 


CAUGAGCU G CAAGUGUG 


1829 


CACACTTG GGCTAGCTACAACGA AGCTCATG 


4157 


2218 


AGCUGCAA G UGUGUGCU 


1830 


AGCACACA GGCTAGCTACAACGA TTGCAGCT 


4158 


2220 


CUGCAAGU G UGUGCUCU 


1831 


AGAGCACA GGCTAGCTACAACGA ACTTGCAG 


4159 


2222 


GCAAGUGU G UGCUCUCC 


1832 


GGAGAGCA GGCTAGCTACAACGA ACACIU'GC 


4160 


2224 


AAGUGUGU G CUCUCCUG 


1833 


CAGGAGAG GGCTAGCTACAACGA ACACACTT 


4161 


2233 


CUCUCCUG A CGCAGGUG 


1834 


CACCTGCG GGCTAGCTACAACGA CAGGAGAG 


4162 


2235 


CUCCUGAC G CAGGUGAG 


1835 


CTCACCTG GGCTAGCTACAACGA GTCAGGAG 


4163 


2239 


UGACGCAG G UGAGGGGG 


1836 


CCCCCTCA GGCTAGCTACAACGA CTGCGTCA 


4164 


2248 


UGAGGGGG A CUCCCAGG 


1837 


CCTGGGAG GGCTAGCTACAACGA CCCCCTCA 


4165 
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2257 


CUCCCAGG G CGGCCGCC 


1838 


GGCGGCCG GGCTAGCTACAACGA PCTGGGAG 


4166 


2260 


CCAGGGCG G CCGCCACG 


1 o o o 


CGTGGCGG GGCTAGCTACAACGA CGCCCTGG 


4167 


2263 


GGGCGGCC G CCACGCCC 


1840 


GGGCGTGG GGCTAGCTACAACGA C5GCCGCCC 

www ww 4 vjw ww w x rvj V« <L <rL wTWX^KJT^ NJ\Jw»WVJvV«i>W> 


4168 


2266 


CGGCCGCC A CGCCCACC 


T O A T 
184 1 


GGTGGGCG GGCTAGCTACAACGA GGCGGCCG 


4169 


2268 


GCCGCCAC G CCCACCGG 


T Q A O 


CCGGTGGG GGCTAGCTACAACGA GTGGCGGC 


4170 


* 2272 


CCACGCCC A CCGGAUGA 


1843 


TCATCCGG GGCTAGCTACAACGA GGGCGTGG 


4171 


2277 


CCCACCGG A UGACCCCG 


1844 


CGGGGTCA GGCTAGCTACAACGA CCGGTGGG 

vwwww ■&> wr» www ± nw w xi^wnn>^vjr\ vw\j\J* www 


4172 


2280 


ACCGGAUG A CCCCGGCU 


1845 


AGCCGGGG GGCTAGCTACAACGA CATCCGGT 

**w wwwww www xnww x nwrvriwwi* \^x*x x 


4173 


2286 


UGACCCCG G CUCCCCGC 


1846 


GCGGGGAG GGCTAGCTACAACGA CGGGGTCA 

wwwwwwfiw wu^# j»nv v 4nwmwwr\ v^wwww/x w/i 


4174 


2293 


GGCUCCCC G CCCCUGCC 


1847 


GGCAGGGG GGCTAGCTACAACGA GGGGAGCC 


4175 


2299 


CCGCCCCU G CCGGUCUC 


1848 


GAGACCGG GGCTAGCTACAACGA APrPrCSPiPr^P 


4176 


2303 


CCCUGCCG G UCUCCUGG 


1849 


CCAGGAGA GGCTAGCTACAACGA PGGCAGCX5 

vw**ww*i,wr* vjvj w x nww x nwiAv* wn wwAjw^www 


4177 


2311 


GUCUCCUG G CCUGCGGU 


1850 


ACCGCAGG GGCTAGCTACAACGA CAGGAGAC 


4178 


-2315 


CCUGGCCU G CGGUCAGC 


1851 


GCTGACCG GGCTAGCTACAACGA AGGCCAGP 

W V* A WX«WWW WWW XX*WW X i^WXV/AWW'L X^VJVJ^^\^X^VJ^J 


4179 


2316 


GGCCUGCG G UCAGCAGC 


1852 


GCTGCTGA GGCTAGCTACAACGA PnP.AHfir'P 


4180 


2 3 22 


UGCGGUCA G CAGCCUCC 


1853 


GGAGGCTG GGCTAGCTACAACGA TGACCGCA 

wwiiiwww x \mJ www iaww i riwiriw w<n x wAwwww«l 


4181 


2325 


GGUCAGCA G CCUCCCUU 


1854 


AAGGGAGG GGCTAGCTACAACGA TGCTGACC 

/uiwwwriww vv v l /ivj v x n wnnw wi* x w w X w/Tw 


4182 


2334 


CCUCCCUU G UGCCCCGC 


1855 


GCGGGGCA GGCTAGCTACAACGA AAGGGARfi 


4183 


2336 


UCCCUUGU G CCCCGCCC 


1856 


GGGCGGHG GGCTAGCTACAACGA ACAAGGCJA 


4184 


2341 


UGUGCCCC G CCCAGCAC 


1857 


ptgctghp* nrsPTAPPTAPAACfiA pnnncAPA 


4185 


2346 


CCCGCCCA G CACAAGCU 


1858 




4186 


2348 


CGCCCAGC A CAAGCUCA 


1859 


TGAP.CTTY5 PP.CTAPCTAPA APPA PPTPPPPP 


4187 


2352 


CAGCACAA G CUCAGGAC 


1860 


fiTrPTCIAf! nfirTAflTTAPa APflA TTPTPPTfl 
w X \— >v— X vxrtAj VJrtJV^ l/iuVu X nv«nrlv^\jri X X vj Ivjv. x vj 


4188 


2359 


AGCUCAGG A CAUGGAGG 


1861 


CCTCCATG GGCTAGCTAPAAPGA PPTGAGPT 


4189 


2361 


CUCAGGAC A UGGAGGUG 


1862 


CACCTPPA GnOTARPTAPAnPfiA RTPPTRAn 


4190 


2367 


ACAUGGAG G UGCCGGAU 


1863 


ATCCGGCA PPPTAPPTAPA APPA PTPPATPT 


4191 


2369 


AUGGAGGU G CCGGAUGC 


1864 


GCATCCGG GGCTAGCTACAACGA ACCTCCAT 


4192 


2374 


GGUGCCGG A UGCAGGAA 


1865 


TTCCTGCA GGCTAGCTACAACGA PPPGPAPP 


4193 


2376 


UGCCGGAU G CAGGAAGG 


1 Q C C 

1866 


CCTTCCTG GGCTAGCTACAACGA ATCCGGCA 

w* «■*• * ww *w www inww ±r*wrirAwwri /ii wwwwwn 


4194 


2387 


GGAAGGAG G UGCAGACG 


1867 


CGTCTGCA GGCTAGCTACAACGA CTCCTTCC 


4195 


2389 


AAGGAGGU G CAGACGGA 


1868 


TCCGTCTG GGCTAGCTACAACGA APPTPPTT 


4196 


2393 


AGGUGCAG A CGGAAGGA 


1869 


TCCTTCCG GGCTAGCTACAACGA CTGCACCT 

^ ^ a ww ww v» i riw w x riwfiri Wwii w X w w«*w w x 


4197 


2415 


AAGGAAGG A CGGAAGGA 


1 O *"7 A 

lo /0 


TGCTTCCG GGCTAGCTACAACGA CCTTCCTT 


4198 


2421 


GGACGGAA G CAAGGAAG 


1871 


CTTCCTTG GGCTAGCTACAACGA TTCCGTCC 

^— • *■ iv www xrvww x / *wnnww«* x x wwwx w 


4199 


I 2439 


AAGGAAGG G CUGCUGGA 


1872 


TCCAGCAG GGCTAGCTACAACGA CCTTCCTT 


4200 


2442 


GAAGGGCU G CUGGAGCC 


1873 


GGCTCCAG GGCTAGCTACAACGA AGCCCTTC 

v**%j ^ www xi*w w x nwOTlww** fXww> W w X X Wf 


4201 


2448 


CUGCUGGA G CCCAGUCA 


1 O *7 A 

1874 


TGACTGGG GGCTAGCTACAACGA TCCAGCAG 


4202 


2453 


GGAGCCCA G UCACCCCG 


1 O *7 C 

1875 


CGGGGTGA GGCTAGCTACAACGA TGGGCTCC 


4203 


2456 


GCCCAGUC A CCCCGGGA 


1876 


TCCCGGGG GGCTAGCTACAACGA GACTGGGC 


4204 


2464 


ACCCCGGG A CCGUGGGC 


1877 


GCCCACGG GGCTAGCTACAACGA CCCGGGGT 

wwwwnwww www x nw w x nwoflwVTn ww wwwww X 


4205 


2467 


CCGGGACC G UGGGCCGA 


1878 


TCGGCCCA GGCTAGCTACAACGA CiCiTClCCClCi 


4206 


2471 


GACCGUGG G CCGAGGUG 


1879 


CACCTCGG GGCTAGCTACAACGA CCACGGTC 


4207 


2477 


GGGCCGAG G UGACUGCA 


1880 


TGCAGTPA GPPTAnPTAPAACPA r^TCddCnr* 


4208 


2480 


CCGAGGUG A CUGCAGAC 


1881 


GTCTGCAG GGCTAGCTACAACGA CACCTCGG 




2483 


AGGUGACU G CAGACCCU 


1882 


AGGGTCTG GGCTAGCTACAACGA AGTCACCT 


4210 


2487 


GACUGCAG A CCCUCCCA 


1883 


TGGGAGGG GGCTAGCTACAACGA CTGCAGTC 


4211 


2501 


CCAGGGAG G CUGUGCAC 


1884 


GTGCACAG GGCTAGCTACAACGA CTCCCTGG 


4212 


2504 


GGGAGGCU G UGCACAGA 


1885 


TCTGTGCA GGCTAGCTACAACGA AGCCTCCC 


4213 


2506 


GAGGCUGU G CACAGACU 


1886 


AGTCTGTG GGCTAGCTACAACGA ACAGCCTC 


4214 


2508 


GGCUGUGC A CAGACUGU 


1887 


ACAGTCTG GGCTAGCTACAACGA GCACAGCC 


4215 


2512 


GUGCACAG A CUGUCUUG 


1888 


CAAGACAG GGCTAGCTACAACGA CTGTGCAC 


4216 


2515 


CACAGACU G UCUUGAAC 


1889 


GTTCAAGA GGCTAGCTACAACGA AGTCTGTG 


4217 
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2522 


UGUCUUGA A CAUCCCAA 


1890 


^nT^•r•/*?r«7vn^ , /~ , PrT^TAflPTAPAAPPiA TCAAGACA 




2524 


UCUUGAAC A UCCCAAAU 


1891 


7v mrrvT>r>nr t 7\ np/TAinP'T 1 A A PGA GTTCAAGA 




2531 


CAUCCCAA A UGCCACCG 


1892 


-"" iZrtrtrvnnnis. r>n r^VAClCT&CAACCiA TTGGGATG 

GGG xGob-A VjlVjL. XAV3V- X tt»>»*tttt.\- W.T1. 




2533 


UCCCAAAU G CCACCGGA 


1893 


rnr*mr*r*Tcm rj/r^TAPPTAPA APGA ATTTGGGA 
TCCGGxGG GGGXAbjv^XAV-AAV*.VJA ttiiiww 




2536 


CAAAUGCC A CCGGAACC 


1894 


"" r^'r^rnrnrtr^nr* nnpTafPTTira APOA GGCATTTG 




2542 


CCACCGGA A CCCCAGCC 


1895 


GGCTGGGG GGG 1 AIjL* XAV-AAU.VSA iuuwaww 


yf TOO 


2548 


GAACCCCA G CCCUUAGC 


1896 


rM-vTm-Knnr* nrPTa^PTAPlVAPfiA TGGGGTTC 

GCM AnUUU vSG V- J. AVjrV*. X AV-AAV-vxri J- vswvjvj v - 


AOOA 


2555 


AGCCCUUA G CUCCCCUC 


1897 


nnnpr?rap riPnTAfiPTArAAPGA TAAGGGCT 

GAGGGGAG b. X AVjV- X ttA— nnv-un. x nnwv-iwv. 




2568 


CCUCCCAG G CCUCUGUG 


1898 


r->7\ r^T.r'Ts.rzn r^nT'AGPTAPAAPGA CTGGGAGG 

lJjy(J/\lj/\L«j VjVj V^ X AvjV^ J. r^.v-ttAV-.vjr* x uuwiwv 


z 0 


2574 


AGGCCUCU G UGGGCCCU 


1899 


^ r*i~T*rtr*ms. nr«r i, PArcr* r FAPAAPGA AGAGGCCT 
AGGGCCGA GGGXAGv^1AW*AV-\ja nun.\j>a\-^A 




2578 


CUCUGUGG G CCCUUGUC 


1900 


" " /^T-n /-I* 7v nnp nrpraPPTaPaAPfiA PCACAGAG 


/l 0*5 CI 


2584 


GGGCCCUU G UCGGGCAC 


1901 


""" r*<T*r*rtnnnTi PPPTAfiPTAPAAPGA AAGGGCCC 

{j ± GbG Luii V3^V-XAVjV~XAV»xvtt\-.v«-». rmvjuui-^^ 




2589 


CUUGUCGG G CACAGAUG 


1902 


"■" n^mmrpTp nrPTappTAPaaPnA PPGACAAG 

CATCT G 1 G GGG IAoL X AV.AAV- VjA v* v_vj.tt.v~rvri.vj 




2591 


UGUCGGGC A CAGAUGGG 


1903 


i-«/-«/-^7v»rr"rr» r , r , r ,r PAf2PTAPAAPGA GCCCGACA 
CCCATG X G GGG J.AbL X AV^AAV_.VJA UV-^uvjrxv^-i 


4ZoX 


2595 


GGGCACAG A UGGGAUCA 


1904 


rrnnrnrrrA nr^r'TAGPTAPAACGA CTGTGCCC 

X Lr A ICL bA X riuU A ri.v_ttri\_.vj.r-i. vavjo.vj^-vv 




2600 


CAGAUGGG A UCACAGUA 


1905 


m-A nmnmr<7\ (VPTanPTRPaAPRA CCCATCTG 

TAGx G 1 GA GGG X Auv- X nrlUU". v. j. vj 


A O 1 "31 


2603 


AUGGGAUC A CAGUAAAU 


1906 


TV mmmn /"TP" PPPTTVPPTAPAAPrJA GATPPCAT 
AT TTAG 1 G bibrb X Abb. X AbAAb.vjA vjr* j. \^v-.v^n.x 


/OTA 


2606 


GGAUCACA G UAAAUUAU 


1907 


7vm7\ 7vrnfTwp7\ nrpTJVPPTAPaaPfiA TGTGATCC 

ATAATx I A GGG 1 AVjb- X AUiAtun x vjr x. vafti v-v. 


/1 0 *a c 


2610 


CACAGUAA A UUAUUGGA 


1908 


'tri/^/-»7\7\rp'n7\ C1T* PT A P* P"P A PA A PP^A TTACTGTG 




2613 


AGUAAAUU A UUGGAUGG 


1909 


r-*/-*7ifTir>r>7i a p*p , p* r rAPir ,r rAPA APGA AATTTACT 

CCATGGAA GGG lAuL. X AV-AAV-VjA rwvx x a r».v- x 


4.6-3 / 


2618 


AUUAUUGG A UGGUCUUG 


1910 


r-i7v7\r»7\r"r«7\ nrPTAnPTAPAAPttA PCAATAAT 
CAAGAGGA GGG X AbV- X Av-AAv_.vja v.\-n«,infi- 




2621 


AUUGGAUG G UCUUGAUC 


1911 


<-»Tvm/-n\ 7\r<7v rinPTTVPPTAPAarnA PATPPAAT 
GATCAAGA buL X AuL. X Av-AAL.vja v-tt. x v_ v_rvn.x 




2627 


UGGUCUUG A UCUUGGUU 


1912 


AACCAAGA GGGXAGGXAv_AAb.vjA v_ttttv?.tt.v_.\_«. 




2633 


UGAUCUUG G UUUUCGGC 


1913 


, 71 7v 7\ t\ npnTaPPTAPAAPCA PAAGATCA 
GCCGAAAA GGG X AGb. X HuiAv-un v^ttrivxtt. x v-r*. 


4241 


2640 


GGUUUUCG G CUGAGGGU 


1914 


tv nnnmpAP npnrapPTlPaaPTJA PGAAAAPP 
AC CC T CAG GGG X AGb. X v^\3r_-if-__-__-_.v>. 


A "i A ^ 

4242 


2647 


GGCUGAGG G UGGGACAC 


1915 


---mppp^ npnrarPTaPaZiPnA PCl'PAGCC 

GTGx bGGA obL X nuL X Av_AAv_.vjA v-v* x v-nw^v. 


/i 0 a *a 
424 j 


2652 


AGGGUGGG A CACGGUGC 


1916 


nn7vnpr"Pr PPPTS /*2/TA PA A PG A PPPAPCCT 


/l O A A 

4244 


2654 


GGUGGGAC A CGGUGCGC 


1917 


r^r^r*r%T\nnn ppPTLflPTAPAAPOA GTCCCACC 
GCGCAGL.G GGi- X Avjv- x±\\-4\t*\-\3r\ \j x v-^w^uv- 


4243 


2657 


GGGACACG G UGCGCGUG 


1918 


r*T\nr*nnn-b PPPTAP.PTAPAAPGA CGTGTCCC 


A9A 


2659 


GACACGGU G CGCGUGUG 


1919 


rrnnnnn ryir*TAf:PTAPAAPRA ACCGTGTC 


AOAT 
424 / 


2661 


CACGGUGC G CGUGUGGC 


1920 


nnPTtPApp nr , r*TAr;p r rAPAAPC5A GCACCGTG 


424o 


2663 


CGGUGCGC G UGUGGCCU 


1921 


Tv/-<rir»r«7\r | T\ pppTAaPTAPAAPGA GCGCACCG 


A*3 A Q 
424 


2665 


GUGCGCGU G UGGCCUGG 


1922 


rtr*7^nr*nnt, PPPTAfiPTAPAAPflA ACGCGCAC 
GvJAGGGvJA bbL iiibU lnwifiuun xi^vjv-vjwi'- 


42D VX 


2668 


CGCGUGUG G CCUGGCAU 


1923 


TiTipppflnr r_/2PTAPPTAPAAPGA CACACGCG 


44j1 


2673 


GUGGCCUG G CAUGAGGU 


1924 


TLT^^pr^npr' riPPTAfiPTAPAAPRA CAGGCCAC 


AO^*> 
4232 


2675 


GGCCUGGC A UGAGGUAU 


1925 


« -P7\ nnwnTi pppTn^PTAPAAPHA GCCAGGCC 


AOd 
42 j j 


2680 


GGCAUGAG G UAUGUCGG 


1926 


r*r*r*T\ n7\ r P7\ nn/TAf-iPTAPAAPnA LTX'CATGCC 
CCGAGA1A bbb 1 Abv. IAV-hauvja v-x v-a-wu\. 


42D4 


2682 


CAUGAGGU A UGUCGGAA 


1927 


rruT> 1 ^ir , r'7sr , A ppPTA^PTAPAAPflA ACCTCATG 


420D 


2684 


UGAGGUAU G UCGGAACC 


1928 


i /-iptpiT'PPPft PPPTAPPTAPaaPfiA ATAPPTPA 
GG 1 1 GGGA GGG X AbL X Av-ftHUVjA Ainv,^ x 


42DO 


2690 


AUGUCGGA A CCUCAGGC 


1929 


r^/~*r*T>r> 7\ nr« nr , /^ p P7\ftir w T , APAAPf5A TCCGACAT 
GCCx GAGG bbb 1 Abb X AV_ttJ-iv_.vjx.tt. x ^unv-n. x 


423 / 


2697 


AACCUCAG G CCUGUCCA 


1930 


" "' «tw^#-*7\ r»7vrT/~" nnpTaPPTAPAAPTlA PTGAGG'iT 
TGGACAGG bubx AbL i ALAAbbA v-xoavjvjx x 


/I O C Q 

42bo 


2701 


UCAGGCCU G UCCAGCCC 


1931 


r*r* p"r»rv-»r» tv pppTaPPTAPAAPTiA AGGPCTGA 
GGGG XoGA bbb X AGG X hLfthLw tt.v_n_rv-. x 


42Dy 


2706 


CCUGUCCA G CCCUGGGC 


1932 


f-tf-^r* np ptr PPT A PA A PTIA TC2GAPAGG 
GCCCAGGG GbblAbblAbAAbbA L\3\jt\\^~tv^j 


42bU 


0*71 

2 / lo 


Adcrmczci a pitpuccau 




ATGGAGAG GGCTAGCTACAACGA CCAGGGCT 


4261 


2720 


GGCUCUCC A UAGCCUUU 


1934 


AAAGGCTA GGCTAGCTACAACGA GGAGAGCC 


4262 


2723 


UCUCCAUA G CCUUUGGG 


1935 


CCCAAAGG GGCTAGCTACAACGA TATGGAGA 


4263 


2740 


AGGGGGAG G UUGGGAGA 


1936 


TCTCCCAA GGCTAGCTACAACGA CTCCCCCT 


4264 


2750 


UGGGAGAG G CCGGUCAG 


1937 


CTGACCGG GGCTAGCTACAACGA CTCTCCCA 


4265 


2754 


AGAGGCCG G UCAGGGGU 


1938 


j ACCCCTGA GGCTAGCTACAACGA CGGCCTCT 


4266 


2761 


GGUCAGGG G UCUGGGCU 


1939 


AGCCCAGA GGCTAGCTACAACGA CCCTGACC 


4267 


2767 


GGGUCUGG G CUGUGGUG 


1940 


CACCACAG GGCTAGCTACAACGA CCAGACCC 


4268 


2770 


UCUGGGCU G UGGUGCUC 


1941 


GAGCACCA GGCTAGCTACAACGA AGCCCAGA 


4269 
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2773 


GGGCUGUG G UGCUCUCU 




AGAGAGCA GGCTAGCTACAACGA CACAGCCC 


4270 


2775 


GCUGUGGU G CUCUCUCC 


1 OA 1 
1 3*i JJ 


GGAGAGAG GGCTAGCTACAACGA ACCACAGC 


4271 


2788 


CUCCUCCC G CCUGCCCC 


i 3** *± 


GGGGCAGG GGCTAGCTACAACGA GGGAGGAG 


4272 


2792 


UCCCGCCU G CCCCAGUG 


X 3^i D 


CACTGGGG GGCTAGCTACAACGA AGGCGGGA 


4273 


2798 


CUGCCCCA G UGUCCACG 


JL 3fr O 


CGTGGACA GGCTAGCTACAACGA TGGGGCAG 


4274 


2800 


GCCCCAGU G UCCACGGC 


13^i 1 


GCCGTGGA GGCTAGCTACAACGA ACTGGGGC 


4275 


2804 


CAGUGUCC A CGGCUUCU 




AGAAGCCG GGCTAGCTACAACGA GGACACTG 


4276 


2807 


UGUCCACG G CUUCUGGC 


1 J^t 3 


GCCAGAAG GGCTAGCTACAACGA CGTGGACA 


4277 


2814 


GGCUUCUG G CAGAGAGC 


X i?DU 


GCTCTCTG GGCTAGCTACAACGA CAGAAGCC 


4278 


2821 


GGCAGAGA G CUCUGGAC 


1331 


GTCCAGAG GGCTAGCTACAACGA TCTCTGCC 


4279 


2828 


AGCUCUGG A CAAGCAGG 


1 QCO 


CCTGCTTG GGCTAGCTACAACGA CCAGAGCT 


4280 


2832 


CUGGACAA G CAGGCAGA 


1 OC1 

JL "jj 


TCTGCCTG GGCTAGCTACAACGA TTGTCCAG 


4281 


-2836 


ACAAGCAG G CAGAUCAU 


l3oft 


ATGATCTG GGCTAGCTACAACGA CTGCTTGT 


4282 


2840 


GCAGGCAG A UCAUAAGG 




CCTTATGA GGCTAGCTACAACGA CTGCCTGC 


4283 


2843 


GGCAGAUC A UAAGGACA 


i y do 


TGTCCTTA GGCTAGCTACAACGA GATCTGCC 


4284 


.2849 


UCAUAAGG A CAGAGAGC 


130 / 


GCTCTCTG GGCTAGCTACAACGA CCTTATGA 


4285 


2856 


GACAGAGA G CUUACUGU 


I-JjO 


ACAGTAAG GGCTAGCTACAACGA TCTCTGTC 


4286 


2860 


GAGAGCUU A CUGUGCUU 


1 Q C Q 

iy D3 


AAGCACAG GGCTAGCTACAACGA AAGCTCTC 


4287 


2863 


AGCUUACU G UGCUUCUA 


JL30U 


TAGAAGCA GGCTAGCTACAACGA AGTAAGCT 


4288 


2865 


CUUACUGU G CUUCUACC 




GGTAGAAG GGCTAGCTACAACGA ACAGTAAG 


4289 


2871 


GUGCUUCU A CCAACUAG 


1962 


CTAGTTGG GGCTAGCTACAACGA AGAAGCAP 


4290 


2875 


UUCUACCA A CUAGGAGG 




CCTCCTAG GGCTAGCTACAACGA TGGTAGAA 


4291 


2884 


CUAGGAGG G CGUCCUGG 


iy 64 


CCAGGACG GGCTAGCTACAACGA CCTCCTAG 


4292 


2886 


AGGAGGGC G UCCUGGUC 




GACCAGGA GGCTAGCTACAACGA GCCPTPPT 


4293 


2892 


GCGUCCUG G UCCUCCAG 




CTGGAGGA GGCTAGCTACAACGA CAGGACGC 


4294 


2907 


AGAGGGAG G UGGUUUCA 


iyb / 


TGAAACCA GGCTAGCTACAACGA CTCCCTCT 


4295 


2910 


GGGAGGUG G UUUCAGGG 


J.300 


CCCTGAAA GGCTAGCTACAACGA CACCTCCC 


4296 


2919 


UUUCAGGG G UUGGGGAU 


1 JO J 


ATCCCCAA GGCTAGCTACAACGA CCCTGAAA 


4297 


2926 


GGUUGGGG A UCUGUGCC 


13 / U 


GGCAGAGA GGCTAGCTACAACGA CCCCAACC 


4298 


2930 


GGGGAUCU G UGCCGGUG 




CACCGGCA GGCTAGCTACAACGA AGATCCCC 


4299 


2932 


GGAUCUGU G CCGGUGGC 


17 / Z 


GCCACCGG GGCTAGCTACAACGA ACAGATCC 


4300 


2936 


CUGUGCCG G UGGCUCUG 


1 3 / j 


CAGAGCCA GGCTAGCTACAACGA CGGCACAG 


4301 


2939 


UGCCGGUG G CUCUGGUC 


1 3 / *± 


GACCAGAG GGCTAGCTACAACGA CACCGGCA 


4302 


2945 


UGGCUCUG G UCUCUGCU 


1 Q*7C 

iy /o 


AGCAGAGA GGCTAGCTACAACGA CAGAGCCA 


4303 


2951 


UGGUCUCU G CUGGGAGC 


13 (O 


GCTCCCAG GGCTAGCTACAACGA AGAGACCA 


4304 


2958 


UGCUGGGA G CCUUCUUG 


1 Q*7*7 
13 / / 


CAAGAAGG GGCTAGCTACAACGA TCCCAGCA 


4305 


2967 


CCUUCUUG G CGGUGAGA 


13 / o 


TCTCACCG GGCTAGCTACAACGA CAAGAAGG 


4306 


12970 


UCUUGGCG G UGAGAGGC 


1 Q*7Q 
13/3 


GCCTCTCA GGCTAGCTACAACGA CGCCAAGA 


4307 


2977 


GGUGAGAG G CAUCACCU 


130U 


AGGTGATG GGCTAGCTACAACGA CTCTCACC 


4308 


2979 


UGAGAGGC A UCACCUUU 


1 3 O 1 


AAAGGTGA GGCTAGCTACAACGA GCCTCTCA 


4309 


2982 


GAGGCAUC A CCUUUCCU 


130Z 


AGGAAAGG GGCTAGCTACAACGA GATGCCTC 


4310 


2992 


CUUUCCUG A CUUGCUCC 


1 QO'J 

iyoj 


GGAGCAAG GGCTAGCTACAACGA CAGGAAAG 


4311 


2996 


CCUGACUU G CUCCCAGC 


T OO/l 

iyo4 


GCTGGGAG GGCTAGCTACAACGA AAGTCAGG 

^ wf*w a fiww x nwfxnwwfi A^nw x wnww 


4312 


3003 


UGCUCCCA G CGUGAAAU 


1985 


ATTTCACG GGCTAGCTACAACGA TGGGAGCA 


4313 


3005 


CUCCCAGC G UGAAAUGC 


1986 


GCATTTCA GGCTAGCTACAACGA GCTGGGAG 


4314 


3010 


AGCGUGAA A UGCACCUG 


1987 


CAGGTGCA GGCTAGCTACAACGA TTCACGCT 


4315 


3012 


CGUGAAAU G CACCUGCC 


1988 


GGCAGGTG GGCTAGCTACAACGA ATTTCACG 


4316 


3014 


UGAAAUGC A CCUGCCAA 


1989 


TTGGCAGG GGCTAGCTACAACGA GCATTTCA 


4317 


3018 


AUGCACCU G CCAAGAAU 


1990 


ATTCTTGG GGCTAGCTACAACGA AGGTGCAT 


4318 


3025 


UGCCAAGA A UGGCAGAC 


1991 


GTCTGCCA GGCTAGCTACAACGA TCTTGGCA 


4319 


3028 


CAAGAAUG G CAGACAUA 


1992 


TATGTCTG GGCTAGCTACAACGA CATTCTTG 


4320 


3032 


AAUGGCAG A CAUAGGGA 


1993 


TCCCTATG GGCTAGCTACAACGA CTGCCATT 


4321 
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3034 


UGGCAGAC A UAGGGACC 


1994 


GGTCCCTA GGCTAGG lAUiALbA biti^'-A 


A *3 *> O 

43 


3040 


ACAUAGGG A CCCCGCCU 


1995 


AGGCGGGG GGCTAGC 1AGAAGGA LLCi/iiui 


a 1 o ^ 


3045 


GGGACCCC G CCUCCUGG 


1996 


CCAGGAGG GGCT AGC 1 AGAAGGA vjvjvtvji 1 j 


4324 


3054 


CCUCCUGG G CCUUCACA 


1997 


TGTGAAGG GGCTAGCTACAACGA ugaggavjvj 


4325 


3060 


GGGCCUUC A CAUGCCCA 


1998 


TGGGCATG GGCTAGCTACAACGA GAA<jrVju.v-v- 


4326 


3062 


GCCUUCAC A UGCCCAGU 


1999 


ACrGGGCA GGCTAGCTAGAAGGA \j 


4327 


3064 


CUUCACAU G CCCAGUUU 


2000 


AAACTGGG GGCTAGCTAGAAGGA Aiijxvi^yj 


4328 


3069 


CAUGCCCA G UUUUCUUC 


2001 


GAAGAAAA G^SCTAGCa AGAAGGA i\jvj*jv~tt.J.v» 


4329 


3079 


UUUCUUCG G CUCUGUGG 


2002 


C CACAGAG GGCTAG C TALAALuA uafwivj^ftft 


4330 


3084 


UCGGCUCU G UGGCCUGA 


2003 


TCAGGCCA GGCTAGGTAl-AAGGA AlaftOv-V-Vart. 


4331 


3087 


GCUCUGUG G CCUGAAGC 


2004 


GCTTCAGG GGCTAGCTAGAAGGA UiU\bftbL 


4332 


3094 


GGCCUGAA G CGGUCUGU 


2005 


ACAGACCG GGCTAGCTAGAAGGA 1 1 v_/vj*jv~v_ 


4333 


3097 


CUGAAGCG G UCUGUGGA 


2006 


TCCACAGA GGCTAGG'IAGAAGGA uu^j.iv-au 


4334 


j 

; 3101 


AGCGGUCU G UGGACCUU 


2007 


AAGGTCCA GGCTAGCi AGAAGGA avjAV-uuv-a 


4335 


3105 


GUCUGUGG A CCUUGGAA 


2008 


TTCCAAGG GGCTAGCTAGAAGGA L.uftv-~rt.vxw- 


4336 


3114 


CCUUGGAA G UAGGGCUC 


2009 


GAGCCCTA GGCTAGCTACAACGA hclaa^ 


4337 


3119 


GAAGUAGG G CUCCAGCA 


2010 


TGCTGGAG GGCTAGCTACAACGA 


4338 


3125 


GGGCUCCA G CACCGACU 


2011 


AGTCGGTG GGCTAGCTACAACGA iggaijI-l.v_ 


4339 


3127 


GCUCCAGC A CCGACUGG 


2012 


CCAGTCGG GGCTAGCTACAACGA gl.iovjAVju 


4340 


3131 


CAGCACCG A CUGGCCUC 


2013 


GAGGCCAG GGCTAGCTAGAAGGA GGGX<av^j.u 


4341 


3135 


ACCGACUG G CCUCAGGC 


2014 


GCCTGAGG GGCTAGCTACAACGA LAblLWjJ. 


4342 


3142 


GGCCUCAG G CCUCUGCC 


2015 


GGCAGAGG GGCTAGCTACAACGA gigaggv~u 


4343 


3148 


AGGCCUCU G CCUCAUUG 


2016 


CAATGAGG GGCTAGCTACAACGA AGAIjOUux 


4344 


3153 


UCUGCCUC A UUGGUGGU 


2017 


ACCACCAA GGCTAGCTACAACGA bAbVjLaba 


4345 


3157 


CCUCAUUG G UGGUCGGG 


2018 


CCCGACCA GG CTAGCT AGAAGGA UiAiwiw 


4346 


3160 


CAUUGGUG G UCGGGUAG 


2019 


CTACCCGA GG CTAGCT AGAAGGA UiLUUiiu 


4347 


3165 


GUGGUCGG G UAGCGGCC 


2020 


GGCCGCTA GGCTAGCI AGAAGGA uul«av-v-a^ 


4348 


3168 


GUCGGGUA G CGGCCAGU 


2021 


ACTGGCCG GGCTAGGI AGAAGGA iAv-t-v-A3*N~ 


4349 


3171 


GGGUAGCG G CCAGUAGG 


2022 


CCTACTGG GGCTAGCI AGAAGGA Ua^iiiv-^ 


4350 


3175 


AGCGGCCA G UAGGGCGU 


2023 


ACGCCCTA GGCTAGG 1 AGAAGGA J.VjvjV-a_vj\_ ± 


4351 


3180 


CCAGUAGG G CGUGGGAG 


2024 


CTCCCACG GGCTAGG 1 AGAAGGA t~u±i\i_j.\jvj 


4352 


3182 


AGUAGGGC G UGGGAGCC 


2025 


GGCTCCCA GGCTAGCTACAACGA (jLllikli 


4353 


3188 


GCGUGGGA G CCUGGCCA 


2026 


TGGCCAGG GG CTAGCT AGAAGGA lLLUiUst 


4354 


3193 


GGAGCCUG G CCAUCCCU 


2027 


AGGGATGG GXjKTAGCTAGAAGGA v_/vj\j*-xv-.v_ 


4355 


3196 


GCCUGGCC A UCCCUGCC 


2028 


GGCAGGGA GX3CTAGCTACAACGA ggughuu^ 


4356 


3202 


CCAUCCCU G CCUCCUGG 


2029 


CCAGGAGG GGCTAGC 1AGAAGGA AvsvavariXvatj 


4357 


3212 


CUCCUGGA G UGGACGAG 


2030 


/-wr>rv~*rn/-tr*7i. f~> r"T , Ts.rT XJ V7^.C , IiT^.CC^7^ TCCAGGAG 
CTCGTCCA buL J. AbL IHvjiA^vaft x v^v-rt.\jvj«.vj 




3216 


UGGAGUGG A CGAGGUUG 


2031 


CAAGCTCG GGCTAGCTACAACGA 


4359 


3221 


UGGACGAG G UUGGCAGC 


2032 


GCTGCCAA GGCTAGCTACAAGGA LiLbiLUi 


4360 


3225 


CGAGGUUG G CAGCUGGU 


2033 


ACCAGCTG GGCTAGGTAGAAGViA UiAtLlLb 


4361 


3228 


GGUUGGCA G CUGGUCCG 


2034 


CGGACCAG GGCTAGCTACAAGGA ibLUiatv. 


4362 


3232 


GGCAGCVG G UCCGVCUG 


2035 


CAGACGGA GX3CTAGCTACAAGGA GfUj<~xvj<-u 


4363 


3236 


GCUGGUCC G UCUGCUCC 


2036 


GGAGCAGA GGCTAGCTACAAGGA ggav-uavjo 


4364 


3240 




2037 


GGCAGGAG GGCTAGCTACAACGA AGACGGAC 


4365 


3246 


CUGCUCCU G CCCCACUC 


2038 


GAGTGGGG GGCTAGCTACAACGA AGGAGCAG 


4366 


3251 


CCUGCCCC A CUCUCCCC 


2039 


GGGGAGAG GGCTAGCTACAACGA GGGGCAGG 


4367 


3261 


UCUCCCCC G CCCCUGCC 


2040 


GGCAGGGG GGCTAGCTACAACGA GGGGGAGA 


4368 


3267 


CCGCCCCU G CCCUCACC 


2041 


GGTGAGGG GGCTAGCTACAACGA AGGGGCGG 


4369 


3273 


CUGCCCUC A CCCUACCC 


2042 


GGGTAGGG GGCTAGCTACAACGA GAGGGCAG 


4370 


3278 


CUCACCCU A CCCUUGCC 


2043 


GGCAAGGG GGCTAGCTACAACGA AGGGTGAG 


4371 


3284 


CUACCCUU G CCCCACGC 


2044 


GCGTGGGG GGCTAGCTACAACGA AAGGGTAG 


i 4372 


3289 


CUUGCCCC A CGCCUGCC 


2045 


GGCAGGCG GGCTAGCTACAACGA GGGGCAAG 


! 4373 
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3291 


UGCCCCAC G CCUGCCUC 


OA/1 

2046 


GAGGCAGG GGCTAGCTACAACGA HTOGGGCA 


4374 


3295 


CCACGCCU G CCUCAUGG 


204 / 


CCATGAGG GGCTAGCTACAACGA AGGCGTGG 


4375 


3300 


CCUGCCUC A UGGCUGGU 


2048 


ACCAGCCA GGCTAGCTACAACGA GAGGCAGG 


4376 


3303 


GCCUCAUG G CUGGUUGC 


2 U4 y 


GCAACCAG GGCTAGCTACAACGA CATGAGGC 


4377 


3307 


CAUGGCUG G UUGCUCUU 


2050 


AAGAGCAA GGCTAGCTACAACGA CAGCCATG 


4378 


3310 


GGCUGGUU G CUCUUGGA 


2051 


TCCAAGAG GGCTAGCTACAACGA AACCAGCC 


4379 


3319 


CUCUUGGA G CCUGGUAG 


Z\jz>Z 


CTACCAGG GGCTAGCTACAACGA TCCAAGAG 


4380 


3324 


GGAGCCUG G UAGUGUCA 


O AC "5 

2 (Jo .3 


TGACACTA GGCTAGCTACAACGA CAGGCTCC 


4381 


• 3327 


GCCUGGUA G UGUCACUG 


2054 


CAGTGACA GGCTAGCTACAACGA TACCAGGC 


4382 


3329 


CUGGUAGU G UCACUGGC 


2055 


GCCAGTGA GGCTAGCTACAACGA ACTACCAG 


4383 


3332 


GUAGUGUC A CUGGCUCA 


2056 


TGAGCCAG GGCTAGCTACAACGA GACACTAC 


4384 


3336 


UGUCACUG G CUCAGCCU 


2057 


AGGCTGAG GGCTAG' TAPA APfiA CAGTGAPA 


4385 


3341 


CUGGCUCA G CCUUGCUG 


2058 


CAGCAAGG GGCTAGCTACAACGA TGAGCCAG 


4386 


3346 


UCAGCCUU G CUGGGUAU 


2059 


ATACCCAG GGCTAGCTACAACGA AAGGCTGA 


4387 


3351 


CUUGCUGG G UAUACACA 


2060 


TGTGTATA GGCTAGCTACAACGA CCAGCAAG 


4388 


• 3353 


UGCUGGGU A UACACAGG 


2061 


CCTGTGTA GGCTAGCTACAACGA AfPrAfiPfi 


4389 


3355 


CUGGGUAU A CACAGGCU 


2062 


AGCCTGTG GGCTAGCTACAACGA ATA CCC AG 


4390 


3357 


GGGUAUAC A CAGGCUCU 


2063 


AGAGCCTG GGCTAGCTACAACGA GTATACCC 


4391 


3361 


AUACACAG G CUCUGCCA 


2064 


TGGCAGAG GGCTAGCTACAACGA CTGTGTAT 


4392 


3366 


CAGGCUCU G CCACCCAC 


2065 


GTGGGTGG GGCTAGCTACAACGA AGAGCCTG 
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4393 


3369 


GCUCUGCC A CCCACUCU 


2066 


AGAGTGGG GGCTAGCTACAACGA GGCAGAGC 


4394 


3373 


UGCCACCC A CUCUGCUC 


2067 


GAGCAGAG GGCTAGCTACAACGA GGGTGGCA 


4395 


3378 


CCCACUCU G CUCCAAGG 


2068 


CCTTGGAG GGCTAGCTACAACGA AGAGTGGG 


4396 


3388 


UCCAAGGG G CUUGCCCU 


2069 


AGGGCAAG GGCTAGCTACAACGA CCCTTGGA 


4397 


3392 


AGGGGCUU G CCCUGCCU 


2070 


AGGCAGGG GGCTAGCTACAACGA AAGCCCCT 


4398 ! 


3397 


CUUGCCCU G CCUUGGGC 


2071 


GCCCAAGG GGCTAGCTACAACGA AGGGCAAG 

>j w w wrtn\j *j inuw ixw^rtfiwun rt.<jvj\j v_^raxi.vj 


4399 


3404 


UGCCUUGG G CCAAGUUC 


2072 


GAACTTGG GGCTAGCTACAACGA CCAAGGCA 


4400 


3409 


UGGGCCAA G UUCUAGGU 


2073 


ACCTAGAA GGCTAGCTACAACGA TTGGCCCA 

*^w- w> *> **w*«»*i wvV Ai*Uv Anv*ATi\-.wrA X X VJV3 wwL>A 


4401 


3416 


AGUUCUAG G UCUGGCCA 


O A*"7 A 


TGGCCAGA GGCTAGCTACAACGA CTAGAACT 


4402 


3421 


UAGGUCUG G CCACAGCC 


2 075 


GGCTGTGG GGCTAGCTACAACGA CAGACCTA 


4403 


3424 


GUCUGGCC A CAGCCACA 


Z U /b 


TGTGGCTG GGCTAGCTACAACGA GGCCAGAC 


4404 


3427 


UGGCCACA G CCACAGAC 


2 U / / 


GTCTGTGG GGCTAGCTACAACGA TGTGGCCA 


4405 


3430 


CCACAGCC A CAGACAGC 


Z U / o 


GCTGTCTG GGCTAGCTACAACGA GGCTGTGG 


4406 


3434 


AGCCACAG A CAGCUCAG 


O A*7 O 

2079 


CTGAGCTG GGCTAGCTACAACGA CTGTGGCT 


4407 


.3437 


CACAGACA G CUCAGUCC 


2 080 


r GGACTGAG GGCTAGCTACAACGA TGTCTGTG 
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4408 


3442 


ACAGCUCA G UCCCCUGU 


2 Ool 


ACAGGGGA GGCTAGCTACAACGA TGAGCTGT 


4409 


3449 


AGUCCCCU G UGUGGUCA 


O A O O 

2 Uo2 


TGACCACA GGCTAGCTACAACGA AGGGGACT 


4410 


3451 


UCCCCUGU G UGGUCAUC 


*5 A Q *a 


GATGACCA GGCTAGCTACAACGA ACAGGGGA 


4411 


3454 


CCUGUGUG G UCAUCCUG 


O AQ /I 


CAGGATGA GGCTAGCTACAACGA CACACAGG 


4412 


3457 


GUGUGGUC A UCCUGGCU 


2085 


AGCCAGGA GGCTAGCTACAACGA GACCACAC 


4413 


3463 


UCAUCCUG G CUUCUGCU 


ZUOD 


AGCAGAAG GGCTAGCTACAACGA CAGGATGA 


4414 


3469 


UGGCUUCU G CUGGGGGC 


O A O *"J 


GCCCCCAG GGCTAGCTACAACGA AGAAGCCA 


4415 


3476 


UGCUGGGG G CCCACAGC 


2Uob 


GCTGTGGG GGCTAGCTACAACGA CCCCAGCA 

w> a w* ^ www V3V3W XflVJw X **wf*f»ww*\ WVWV«*WWil 


4416 


3480 


GGGGGCCC A CAGCGCCC 


2089 


GGGCGCTG GGCTAGCTACAACGA GGGCCCCC 


*X*± X, / 


3483 


GGCCCACA G CGCCCCUG 


2090 


CAGGGGCG GGCTAGCTACAACGA TGTGGGCC 


4418 


3485 


CCCACAGC G CCCCUGGU 


2091 


ACCAGGGG GGCTAGCTACAACGA GCTGTGGG 


4419 


3492 


CGCCCCUG G UGCCCCUC 


2092 


GAGGGGCA GGCTAGCTACAACGA CAGGGGCG 


4420 


3494 


CCCCUGGU G CCCCVCCC 


2093 


GGGAGGGG GGCTAGCTACAACGA ACCAGGGG 


4421 


3511 


CUCCCAGG G CCCGGGUU 


2094 


AACCCGGG GGCTAGCTACAACGA CCTGGGAG 


4422 


3517 


GGGCCCGG G UUGAGGCU 


2095 


AGCCTCAA GGCTAGCTACAACGA CCGGGCCC 


4423 


3523 


GGGUUGAG G CUGGGCCA 


2096 


TGGCCCAG GGCTAGCTACAACGA CTCAACCC 


4424 


3528 


GAGGCUGG G CCAGGCCC 


2097 


GGGCCTGG GGCTAGCTACAACGA CCAGCCTC 


4425 
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3533 



3543 



3549 



3553 



3555 



3560 



3566 



3573 



UGGGCCAG G CCCUCUGG 



CCUCUGGG A CGGGGACU 



GGACGGGG A CUUGUGCC 



GGGGACUU G UGCCCUGU 



GGACUUGU G CCCUGUCA 



UGUGCCCU G UCAGGGUU 



CUGUCAGG G UUCCCUAU 



GGUUCCCU A UCCCUGAG 



2098 



2099 



2100 



2101 



2102 



2103 



2104 



2105 



CCAGAGGG GGCTAGCTACAACGA CTGGCCCA 



AGTCCCCG GGCTAGCTACAACGA CCCAGAGG 



GGCACAAG GGCTAGCTACAACGA CCCCGTCC 



ACAGGGCA GGCTAGCTACAACGA AAGTCCCC 



TGACAGGG GGCTAGCTACAACGA ACAAGTCC 



AACCCTGA GGCTAGCTACAACGA AGGGCACA 



ATAGGGAA GGCTAGCTACAACGA CCTGACAG 



CTCAGGGA GGCTAGCTACAACGA AGGGAACC 



4426 



4427 



4428 



4429 



4430 



4431 



4432 
4433 



3582 



UCCCUGAG G UUGGGGGA 



2106 



3593 



GGGGGAGA G CUAGCAGG 



2107 



3597 



GAGAGCUA G CAGGGCAU 



2108 



3602 



CUAGCAGG G CAUGCCGC 



3604 



AGCAGGGC A UGCCGCUG 



3606 



CAGGGCAU G CCGCUGGC 



3609 



GGCAUGCC G CUGGCUGG 



3613 



UGCCGCUG G CUGGCCAG 



2109 



2110 



2111 



2112 



3617 



GCUGGCUG G CCAGGGCU 



3623 



UGGCCAGG G CUGCAGGG 



3626 



CCAGGGCU G CAGGGACA 



3632 



CUGCAGGG A CACUCCCC 



3634 



GCAGGGAC A CUCCCCCU 



3646 



CCCCUUUU G UCCAGGGA 



3655 



UCCAGGGA A UACCACAC 



2113 



2114 



2115 



2116 



2117 



2118 



2119 



2120 



3657 



CAGGGAAU A CCACACUC 



3660 



GGAAUACC A CACUCGCC 



3662 



AAUACCAC A CUCGCCCU 



3666 



CCACACUC G CCCUUCUC 



3679 



UCUCUCCA G CGAACACC 



2121 



2122 



2123 



2124 



2125 



TCCCCCAA GGCTAGCTACAACGA CTCAGGGA 



CCTGCTAG GGCTAGCTACAACGA TCTCCCCC 



ATGCCCTG GGCTAGCTACAACGA TAGCTCTC 



GCGGCATG GGCTAGCTACAACGA CCTGCTAG 



CAGCGGCA GGCTAGCTACAACGA GCCCTGCT 



GCCAGCGG GGCTAGCTACAACGA ATGCCCTG 



CCAGCCAG GGCTAGCTACAACGA GGCATGCC 



CTGGCCAG GGCTAGCTACAACGA CAGCGGCA 



AGCCCTGG GGCTAGCTACAACGA CAGCCAGC 



CCCTGCAG GGCTAGCTACAACGA CCTGGCCA 



TGTCCCTG GGCTAGCTACAACGA AGCCCTGG 



GGGGAGTG GGCTAGCTACAACGA CCCTGCAG 



AGGGGGAG GG CTAGCTACAACGA GTCCCTGC 
TCCCTGGA GGCTAGCTACA ACGA AAAAGGGG 
GTGTGGTA GGCTAGCTACAACGA TCCCTGGA 



GAGTGTGG GGCTAGCTACAACGA ATTCCCTG 



GGCGAGTG GGCTAGCTACAACGA GGTATTCC 



AGGGCGAG GGCTAGCTACAACGA GTGGTATT 



GAGAAGGG GGCTAGCTACAACGA GAGTGTGG 



GGTGTTCG GGCTAGCTACAACGA TGGAGAGA 



4434 



4435 
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2 lO t> 


TCGGGAAG GGCTAGCTACAACGA CTATCGAT 


4482 


4479 


UUCCCGAG G CCAGGAGC 




GCTCCTGG GGCTAGCTACAACGA CTCGGGAA 


4483 


4486 


GGCCAGGA G CCCUCUAG 


aIj / 


CTAGAGGG GGCTAGCTACAACGA TCCTGGCC 


4484 


4496 


CCUCUAGG G CUGCCGGG 


CO 


CCCGGCAG GGCTAGCTACAACGA CCTAGAGG 


4485 


4499 


CUAGGGCU G iCCGGGUGC 


O "1 C O 


GCACCCGG GGCTAGCTACAACGA AGCCCTAG 


4486 


4504 


GCUGCCGG G UGCCACCC 


zlbU 


GGGTGGCA GGCTAGCTACAACGA CCGGCAGC 


4487 


4506 


UGCCGGGU G CCACCCUG 


2ioi 


CAGGGTGG GGCTAGCTACAACGA ACCCGGCA 


4488 


4509 


CGGGUGCC A"CCCUGGCU 


O 1 CO 
2 -L bz 


AGCCAGGG GGCTAGCTACAACGA GGCACCCG 


4489 


4515 


CCACCCUG G CUCCUUCC 




GGAAGGAG GGCTAGCTACAACGA CAGGGTGG 


4490 


4524 


CUCCUUCC A CACCGUGC 


O T C/t 


GCACGGTG GG CTAGCTACAAPG A GGAAGGAG 


4491 


4526 


CCUUCCAC A CCGUGCUG 


O T 


CAGCACGG GGCTAGCTACAACGA GTGfSAAGn 


4492 


4529 


UCCACACC G UGCUGGUC 


once 


GACCAGCA GGCTAGCTACAACGA GGTGTGGA 


4493 


4531 


CACACCGU G CUGGUCAC 


O T C7 

2167 


GTGACCAG GGCTAGCTACAACGA APCinTTTrri 


4494 


4535 


CCGUGCUG G UCACUGCC 


2168 


GGCAGTGA GGCTAGCTACAACGA CAOPAPGf: 


4495 


4538 


UGCUGGUC A CUGCCUGC 


O 1 CO 

2169 


GCAGGCAG GGCTAGCTACAACV3A GAPCAGCA 


4496 


4541 


UGGUCACU G CCUGCUGG 


2170 


CCAGCAGG GGCTAGCTACAACGA AGTGACCA 


4497 


4545 


CACUGCCU G CUGGGGGC 


2171 


GCCCCCAG GGCTAGCTACAACGA AOGPAOTY4 


4498 


4552 


UGCUGGGG G CGUCAGAU 


o i oo 

2172 


ATCTGACG GGCTAGCTACAAPfSA C , C , CC'TiGr'7i 


4499 


4554 


CUGGGGGC G UCAGAUGC 


O 1 O O 

21 /J 


GCATCTGA GGCTAGCTACAACGA GCCPPCACl 


4500 


4559 


GGCGVCAG A UGCAGGUG 


2174 


CACCTGCA GGPTAGPTAPAAPfJA PTTCAfYSPP 


4501 


4561 


CGUCAGAU G CAGGUGAC 


oi oc 
21 /o 


GTCACCTG GGCTAGCTACAACGA ATCTGACO 


4502 


4565 


AGAUGCAG G UGACCCUG 


Oi id 


CAGGGTCA GGCTAGCTACAACGA CTGCATCT 


4503 


4568 


UGCAGGUG A CCCUGUGC 


21 / / 


GCACAGGG GGCTAGCTACAACGA CACCTGCA 


4504 


4573 


GUGACCCU G UGCAGGAG 


21 / o 


CTCCTGCA GGCTAGCTACAACGA AGGGTCAC 


4505 


4575 


GACCCUGU G CAGGAGGU 


O 1 1 Q 

21 /y 


ACCTCCTG GG CTAG CTACAA PGA APAGGGTP 


4506 


4582 


UGCAGGAG G UAUCUCUG 


Oi on 
21o0 


CAGAGATA GGCTAGCTACAACGA CTCCTGCA 


4507 


4584 


CAGGAGGU A UCUCUGGA 


OI PI 


TCCAGAGA GGCTAGCTACAACGA ACCTCCTG 


4508 


4592 


AUCUCUGG A CCUGCCUC 


O 1 Q O 

21o2 


GAGGCAGG GGCTAGCTACAACGA CCAGAGAT 


4509 


4596 


CUGGACCU G CCUCUUGG 


Ol DO 
21oo 


CCAAGAGG GGCTAGCTACAACGA AGGTCCAG 


4510 


4604 


GCCUCUUG G "UCAUUACG 


Ol DA 


CGTAATGA GGCTAGCTACAACGA CAAGAGGC 


4511 


4607 


UCUUGGUC A UUACGGGG 


01 QC 

2 JLOD 


CCCCGTAA GGCTAGCTACAACGA GACCAAGA 


4512 


4610 


UGGUCAUU A CGGGGCUG 


01 or 

2 loo 


CAGCCCCG GGCTAGCTACAACGA AATGACCA 


4513 


4615 


AUUACGGG G CUGGGCAG 


Ol ft "7 
ZJ.O / 


CTGCCCAG GGCTAGCTACAACGA CCCGTAAT 


4514 


4620 


GGGGCVGG G CAGGGCCU 


Ol OQ 

21oo 


AGGCCCTG GGCTAGCTACAACGA CCAGCCCC 


4515 


4625 


UGGGCAGG G CCUGGUAU 


O 1 O O 

2 lo y 


ATACCAGG GGCTAGCTACAACGA CCTGCCCA 


4516 


4630 


AGGGCCUG G UAUCAGGG 


2190 


CCCTGATA GGCTAGCTACAACGA CAGGCCCT 


4517 


4632 


GGCCUGGU A UCAGGGCC 


2191 


GGCCCTGA GGCTAGCTACAACGA ACCAGGCC 


4518 


4638 


GUAUCAGG G CCCCGCUG 


2192 


CAGCGGGG GGCTAGCTACAACGA CCTGATAC 


4519 


4643 


AGGGCCCC G CUGGGGUU 


2193 


AACCCCAG GGCTAGCTACAACGA GGGGCCCT 


4520 


4649 


CCGCUGGG G UUGCAGGG 


2194 


CCCTGCAA GGCTAGCTACAACGA CCCAGCGG 


4521 


4652 


CUGGGGUU G CAGGGCUG 


2195 


CAGCCCTG GGCTAGCTACAACGA AACCCCAG 


4522 


4657 


GUUGCAGG G CUGGGCCU 


2196 


AGGCCCAG GGCTAGCTACAACGA CCTGCAAC 


4523 


4662 


AGGGCUGG G CCUGUGCU 


2197 


AGCACAGG GGCTAGCTACAACGA CCAGCCCT 


4524 


4666 


CUGGGCCU G UGCUGUGG 


2198 


CCACAGCA GGCTAGCTACAACGA AGGCCCAG 


4525 
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4668 I GGGCCUGU G CUGUGGUC 2 199 



4671 I CCUGUGCU G UGGUCCUG 2 200 



4674 GUGCUGUG G UCCUGGGG 2 201 



4682 I GUCCUGGG G UGUCCAGG 2 202 



4684 I CCUGGGGU G UCCAGGAC 2 203 



4691 I UGUCCAGG A CAGACGUG 22 Q4 



GACCACAG GGCTAGCTACAACGA 



CAGGACCA GGCTAGCTACAACGA 



ACAGGCCC 
AGCACAGG 



CCCCAGGA GGCTAGCTACAACGA CACAGCAC 



CCTGGACA GGCTAGCTACAACGA CCCAGGAC 



GTCCTGGA GGCTAGCTACAACGA ACCCCAGG 



4695 1 CAGGACAG A CGUGGAGG 22 Q5 



4697 1 GGACAGAC G UGGAGGGG 2206 



4705 GUGGAGGG G UCAGGGCC 220 7 



4711 I GGGUCAGG G CCCAGCAC 22 08 



4716 I AGGGCCCA G CACCCCUG 22 Q9 



4718 | GGCCCAGC A CCCCUGCU 22 i 0 



4724 I GCACCCCU G CUCCAUGC 2211 



4729 I CCUGCUCC A UGCUGAAC 2212 



4731 I UGCUCCAU G CUGAACUG 22 i3 



4736 I CAUGCUGA A CUGUGGGA 221 4 



4739 I GCUGAACU G UGGGAAGC 22 i5 



4746 I UGUGGGAA G CAUCCAGG 2 216 



4748 1 UGGGAAGC A UCCAGGUC 22 i7 



4754 1 GCAUCCAG G UCCCUGGG 22X8 



4762 I GUCCCUGG G UGGCUUCA 22 i9 



4765 | C CUGGGUG G CUUCAACA | 2220 
4771 I UGGCUUCA A CAGGAGUU I 2 221 



4777 I CAACAGGA G UUCCAGCA 2222 



4783 I GAGUUCCA G CACGGGAA 22 23 



4785 I GUUCCAGC A CGGGAACC 2224 



4791 I GCACGGGA A CCACUGGA 222 s 



4794 I CGGGAACC A CUGGACAA 222 6 



4799 1 ACCACUGG A CAACCUGG 2227 



4802 I ACUGGACA A CCUGGGGU 222 s 



4809 I AACCUGGG G UGUGUCCU 2229 



4811 I CCUGGGGU G UGUCCUGA 22 3Q 



4813 I UGGGGUGU G UCCUGAUC 22 31 



4819 I GUGUCCUG A UCUGGGGA 22 3 2 



4827 ! AUCUGGGG A CAGGCCAG 22 33 



4831 1 GGGGACAG G CCAGCCAC 2234 



4835 1 ACAGGCCA G CCACACCC 223 5 



4 838 I GGCCAGCC A CACCCCGA 2 236 



4840 I CCAGCCAC A CCCCGAGU 22 37 



4847 I CACCCCGA G UCCUAGGG 223 8 



4856 I UCCUAGGG A CUCCAGAG 223 9 



4866 | U CCAGAGA G CAGCCCAC | 2240 
4869 AGAGAGCA G CCCACUGC 224 i 



4873 I AGCAGCCC A CUGCCCUG 2242 



4876 I AGCCCACU G CCCUGGGC 2 243 



4883 I UGCCCUGG G CUCCACGG 2244 



4 888 | UGGGCUCC A CGGAAGCC 224 5 



4894 I CCACGGAA G CCCCCUCA 224 6 



4902 I GCCCCCUC A UGCCGCUA 224 7 



4904 
4907 



CCCCUCAU G CCGCUAGG 224 



CACGTCTG GGCTAGCTACAACGA CCTGGACA 



CCTCCACG GGCTAGCTACAACGA CTGTCCTG 



CCCCTCCA GGCTAGCTACAACGA GTCTGTCC 



GGCCCTGA GGCTAGCTACAACGA CCCTCCAC 



GTGCTGGG GGCTAGCTACAACGA CCTGACCC 



CAGGGGTG GGCTAGCTACAACGA TGGGCCCT 



AGCAGGGG GGCTAGCTACAACGA GCTGGGCC 



GCATGGAG GGCTAGCTACAACGA AGGGGTGC 



GTTCAGCA GGCTAGCTACAACGA GGAGCAGG 



CAGTTCAG 
TCCCACAG 



GGCTAGCTACAACGA ATGGAGCA 



GGCTAGCTACAACGA TCAGCATG 



GCTTCCCA GGCTAGCTACAACGA AGTTCAGC~ 



CCTGGATG GGCTAGCTACAACGA TTCCCACA" 



GACCTGGA GGCTAGCTACAACGA GCTTCCCA 



CCCAGGGA GGCTAGCTACAACGA CTGGATGC 



TGAAGCCA GGCTAGCTACAACGA CCAGGGAC 



TGTTGAAG GGCTAGCTACAACGA CACCCAGG 



AACTCCTG GGCTAGCTACAACGA TGAAGCCA 



TGCTGGAA GGCTAGCTACAACGA TCCTGTTG 



TTCCCGTG GGCTAGCTACAACGA TGGAACTC 



GGTTCCCG GGCTAGCTACAACGA GCTGGAAC 



TCCAGTGG GGCTAGCTACAACGA TCCCGTGC 



TTGTCCAG GGCTAGCTACAACGA GGTTCCCG 



CCAGGTTG GGCTAGCTACAACGA CCAGTGGT 



ACCCCAGG GGCTAGCTACAACGA TGTCCAGT 



AGGACACA GGCTAGCTACAACGA CCCAGGTT 



TCAGGACA GGCTAGCTACAACGA ACCCCAGG 



GATCAGGA GGCTAGCTACAACGA ACACCCCA 



TCCCCAGA GGCTAG CTACAACGA CAGGACAC 
CTGGCCTG GGCTAGCTACAACGA CCCCAGAT 



GTGGCTGG GGCTAGCTACAACGA CTGTCCCC 



GGGTGTGG GGCTAGCTACAACGA TGGCCTGT 



TCGGGGTG GGCTAGCTACAACGA GGCTGGCC 



ACTCGGGG GGCTAGCTACAACGA GTGGCTGG 



CCCTAGGA GGCTAGCTACAACGA TCGGGGTG 



CTCTGGAG GGCTAGCTACAACGA CCCTAGGA 



GTGGGCTG GGCTAGCTACAACGA TCTCTGGA 



GCAGTGGG GGCTAGCTACAACGA TGCTCTCT 



CAGGGCAG GGCTAGCTACAACGA GGGCTGCT 



GCCCAGGG GGCTAGCTACAACGA AGTGGGCT 



CCGTGGAG GGCTAGCTACAACGA CCAGGGCA 



GGCTTCCG GGCTAGCTACAACGA GGAGCCCA 



TGAGGGGG GGCTAGCTACAACGA TTCCGTGG 



TAGCGGCA GGCTAGCTACAACGA GAGGGGGC 



4526 



CCTAGCGG GGCTAGCTACAACGA 



ATGAGGGG 
GGCATGAG 



4527 



4528 



4529 



4530 



4531 



4532 



4533 



4534 



4535 



4536 



4537 



4538 



4539 



4540 



4541 



4542 



4543 



4544 



4545 



4546 



4547 



4548 



4549 



4550 



4551 



4552 



4553 



4554 



4555 



4556 
4557 
4558 



4559 



4560 



4561 



4562 



4563 
4564 



4565 



4566 



4567 



4568 



4569 



4570 



4571 



4572 



4573 



4574 



4575 



CUCAUGCC G CUAGGCCU 224 9 



AGGCCTAG GGCTAGCTACAACGA 



4576 



4912 I GCCGCUAG G CCUUGGCC 22 5Q 



GGCCAAGG GGCTAGCTACAACGA CTAGCGGC 



4577 



WO 02/097114 
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4918 


AGGCCUUG G CCUCGGGG 


2251 


CCCCGAGG GG CTAGCTACAACGA CAAGGCCT 


4578 


4927 


CCUCGGGG A CAGCCCAG 


2252 


CTGGGCTG GG CT AG CT ACAACGA CCCCGAGG 


4579 


4930 


CGGGGACA G CCCAGCUA 


2253 


TAGCTGGG GGCTAGCTACAACGA TGTCCCCG 


4580 1 


4935 


ACAGCCCA G CUAGGCCA 


2254 


TGGCCTAG GGCTAGCTACAACGA TGGGCTGT 


4581 


4940 


CCAGCUAG G CCAGUGUG 


2255 


CACACTGG GGCTAGCTACAACGA CTAGCTGG 

W* V A WW WWW X *\W W AilWlAwW** W^XlWW'X WW 


4582 


4944 


CUAGGCCA G UGUGUGGC 


2256 


GCCACACA GGCTAGCTACAACGA TGGCCTAG 


4583 


4946 


AGGCCAGU G UGUGGCAG 


2257 


CTGCCACA GGCT AG CT AOA A CG A ACTGGCCT 

w jl w w wfiwr* V3w w ^ nw w xnv^nAwwr\ nw x wiwi w w !■ 


4584 


4948 


GCCAGUGU G UGGCAGGA 


2258 


TCCTGCCA GGCTAGCTACAACGA ACACTGGC 


4585 


4951 


AGUGUGUG G CAGGACCA 


2259 


TGGTCCTG GGCTAGCTACAACGA CACACACT 


4586 


4956 


GUGGCAGG A CCAGGCCC 


2260 


GGGCCTGG GGCTAGCTACAACGA CCTGCCAC 


4587 


4961 


AGGACCAG G CCCCCAUG 


2261 


CATGGGGG GGCTAGCTACAACGA CTGGTCCT 


4588 


4967 


AGGCCCCC A UGUGGGAG 


2262 


CTCCCACA GGCTAGCTACAACGA GGGGGCCT 


4589 


4969 


GCCCCCAU G UGGGAGCU 


2263 


AGCTCCCA GGCTAGCTACAACGA ATGGGGGC 


4590 


4975 


AUGUGGGA G CUGACCCC 


2264 


GGGGTCAG GGCTAGCTACAACGA TCCCACAT 

WVTwW a Wlw WWW X a*ww X *4Wm1^U*1 X W N^fV^f* X 


4591 | 


4979 


GGGAGCUG A CCCCUUGG 


2265 


CCAAGGGG GGCTAGCTACAACGA CAGCTCCC 


4592 


4989 


CCCUUGGG A UUCUGGAG 


2266 


CTCCAGAA GGCTAGCTACAACGA CCCAAGGG 


4593 


4997 


AUUCUGGA G CUGUGCUG 


2267 


CAGCACAG GGCTAGCTACAACGA TCCAGAAT 


4594 


5000 


CUGGAGCU G UGCUGAUG 


2268 


CATCAGCA GGCTAGCTACAACGA AGCTCCAG 


4595 


5002 


GGAGCUGU G CUGAUGGG 


2269 


CCCATCAG GGCTAGCTACAACGA ACARPTCC 


4596 


5006 


CUGUGCUG A UGGGCAGG 


2270 


CCTGCCQA GGCTAGCTACAACGA CARCACACJ 


4597 


5010 


GCUGAUGG G CAGGGGAG 


2271 


CTCCCCTG GGCTAGCTACAACGA CCATCAGC 


4598 


5020 


AGGGGAGA G CCAGCUCC 


2272 


C^TAGCTGG GGCTAGCTACAACGA TCTPPPPT 


4599 


5024 


GAGAGCCA G CUCCUCCC 


2273 


GGGAGGAG GGCTAGCTACAACGA TGGCTCTC 


4600 


5044 


GAGGGAGG G UCUUGAUG 


2274 


CATCAAGA GGCTAGCTACAACGA CCTCCCTC 


4601 


5050 


GGGUCUUG A UGCCUGGG 


2275 


CCCAGGCA GGCTAGCTACAACGA CAAGACCC 


4602 


5052 


GUCUUGAU G CCUGGGGU 


2276 


ACCCCAGG GGCTAGCTACAACGA ATCAAGAC 

iivv v^\^-x*V_> \__/ www X n\Jw X iiV,»iTXlWW** -tx _L \_.. fVxVJXIW 


4603 


5059 


UGCCUGGG G UUACCCGC 


2277 


GCGGGTAA GGCTAGCTACAACGA CCCAGGCA 


4604 


5062 


CUGGGGUU A CCCGCAGA 


2278 


TCTGCGGG GGCTAGCTACAACGA AACCCCAG 


4605 


5066 


GGUUACCC G CAGAGGCC 


2279 


GGCCTCTG GGCTAGCTACAACGA GGGTAACC 


4606 


5072 


CCGCAGAG G CCUGGGUG 


2280 


CACCCAGG GGCTAGCTACAACGA CTCTGCGG 

>-xrvwwwrxww www -_nw^ __. -r\v^_rvr\ww-ri w. x X w www 


4607 


5078 


AGGCCUGG G UGCCGGGA 


2281 


TCCCGGCA GGCTAGCTACAACGA CCAGGCCT 


4608 


5080 


GCCUGGGU G CCGGGACG 


2282 


CGTCCCGG GGCTAGCTACAACGA ACCCAGGC 


4609 


5086 


GUGCCGGG A CGCUCCCC 


2283 


GGGGAGCG GGCTAGCTACAACGA CCCGGCAC 


4610 


5088 


GCCGGGAC G CUCCCCGG 


2284 


CCGGGGAG GGCTAGCTACAACGA GTCCCGGC 


4611 


5096 


GCUCCCCG G : UUUGGCUG 


2285 


CAGCCAAA GGCTAGCTACAACGA CGGGGAGC 


4612 


5101 


CCGGUUUG G CUGAAAGG 


2286 


CCTTTCAG GGCTAGCTACAACGA CAAACCGG 

X X Jb WTIW WW ^* J, ixW W A ^Wi***WWf* WiTlOWiW wW\J 


4613 


5113 


AAAGGAAA G "CAGAUGUG 


2287 


CACATCTG GGCTAGCTACAACGA TTTCCTTT 


4614 


5117 


GAAAGCAG A UGUGGUCA 


2288 


TGACCACA GGCTAGCTACAACGA CTGCTTTC 

1 wTVww/*w** UvC li*Uw 1 Awlf_\-.un v—- X ww X X X w 


4615 


5119 


AAGCAGAU G UGGUCAGC 


2289 


GCTGACCA GGCTAGCTACAACGA ATCTGCTT 


4616 


5122 


CAGAUGUG G UCAGCUUC 


2290 


GAAGCTGA GGCTAGCTACAACGA CACATCTG 

wxxtiw w x w«x www x -rvwT w«» x +±\ t **x^Jr\\>*m\J**±. w-TiwJx x x %J 


4617 


5126 


UGUGGUCA G CUUCUCCA 


2291 


TGGAGAAG GGCTAGCTACAACGA TGACCACA 

X w\3i4WfM»VJ WWW X JrltV-W* X AV^ l^/^WWi* X wf\w wfTWXl 


4618 


5134 


GCUUCUCC A CUGAGCCC 


2292 


GGGCTCAG GGCTAGCTACAACGA GGAGAAGC 

wwww _L \_-,-t^vj www x nw w A«*wnriwwn wvjLfiVjririvj w 


4619 


5139 


UCCACUGA G CCCAUCUG 


2293 


CAGATGGG GGCTAGCTACAACGA TCAGTGGA 


4620 


5143 


CUGAGCCC A UCUGGUCU 


2294 


AGACCAGA GGCTAGCTACAACGA GGGCTCAG 




5148 


CCCAUCUG G UCUUCCCG 


2295 


CGGGAAGA GGCTAGCTACAACGA CAGATGGG 


4622 


5159 


UUCCCGGG G CUGGGCCC 


2296 


GGGCCCAG GGCTAGCTACAACGA CCCGGGAA 


4623 


5164 


GGGGCUGG G CCCCAUAG 


2297 


CTATGGGG GGCTAGCTACAACGA CCAGCCCC 


4624 


5169 


UGGGCCCC A UAGAUCUG 


2298 


CAGATCTA GGCTAGCTACAACGA GGGGCCCA 


4625 


5173 


CCCCAUAG A UCUGGGUC 


2299 


GACCCAGA GGCTAGCTACAACGA CTATGGGG 


4626 


5179 


AGAUCUGG G UCCCUGUG 


2300 


CACAGGGA GGCTAGCTACAACGA CCAGATCT 


4627 


5185 


GGGUCCCU G UGUGGCCC 


2301 


GGGCCACA GGCTAGCTACAACGA AGGGACCC 


4628 


5187 


GUCCCUGU G UGGCCCCC 


2302 


GGGGGCCA GGCTAGCTACAACGA ACAGGGAC 


4629 
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5190 



5199 



5204 



5206 



CCUGUGUG G CCCCCCUG 



CCCCCCUG G UCUGAUGC 



2303 



CUGGUCUG A UGCCGAGG 



GGUCUGAU G CCGAGGAU 



5213 



5215 



5221 



5225 



5228 



5232 



5242 



5247 



UGCCGAGG A UACCCCUG 



CCGAGGAU A CCCCUGCA 



AUACCCCU G CAAACUGC 



CCCUGCAA A CUGCCAAU 



UGCAAACU G CCAAUCCC 



AACUGCCA A UCCCAGAG 



CCCAGAGG A CAAGACUG 



AGGACAAG A CUGGGAAG 



5255 



5261 



5270 



5274 



5280 



5282 



5288 



5292 



5301 



5307 



ACUGGGAA G UCCCUGCA 



AAGUCCCU G CAGGGAGA 



CAGGGAGA G CCCAUCCC 



GAGAGCCC A UCCCCGCA 



CCAUCCCC G CACCCUGA 



AUCCCCGC A CCCUGACC 



GCACCCUG A CCCACAAG 



CCUGACCC A CAAGAGGG 



CAAGAGGG A CUCCUGCU 



5310 



5314 



5319 



GGACUCCU G CUGCCCAC 



CUCCUGCU G CCCACCAG 



UGCUGCCC A CCAGGCAU 



CCCACCAG G CAUCCCUC 



CACCAGGC A UCCCUCCA 



2304 



2305 



2306 



2307 



2308 



2309 



2310 



2311 



2312 



2313 



2314 



2315 



2316 



" CAGGGGGG GGCTAGCTACAACGA CACACAGG 



GCATCAGA GGCTAGCTACAACGA CAGGGGGG 



CCTCGGCA GGCTAGCTACAACGA CAGACCAG 



"ATCCTCGG GGCTAGCTACAACGA ATCAGACC 



CAGGGGTA GGCTAGCTACAACGA CCTCGGCA 



TGCAGGGG GGCTAGCTACAACGA ATCCTCGG 



"GCAGTTTG GGCTAGCTACAACGA AGGGGTAT 



ATTGGCAG GGCTAGCTACAACGA TTGCAGGG 



GGGATTGG GGCTAGCTACAACGA AGTTTGCA 



CTCTGGGA GGCTAGCTACAACGA TGGCAGTT_ 



"CAGTCTTG GGCTAGCTACAACGA CCTCTGGG 



CTTCCCAG GGCTAGCTACAACGA CTTGTCCT 



" TGCAGGGA GGCTAGCTACAACGA TTCCCAGT 



TCTCCCTG GGCTAGCTACAACGA AGGGACTT 



2317 



2318 



2319 



2320 



"GGGATGGG GGCTAGCTACAACGA TCTCCCTC^ 



' TGCGGGGA GGCTAGCTACAACGA GGGCTCTC 



TCAGGGTG GGCTAGCTACAACGA GGGGATGG 



GGTCAGGG GGCTAGCTACAACGA GCGGGGAT 



2321 



2322 



2323 



2324 



2325 



2326 



CTTGTGGG GGCTAGCTACAACGA CAGGGTGC 



CCCTCTTG GGCTAGCTACAACGA GGGTCAGG 



AGCAGGAG GGCTAGCTACAACGA CCCTCTTG 



"GTGGGCAG GGCTAGCTACAACGA AGGAGTCC 



CTGGTGGG GGCTAGCTACAACGA AGCAGGAG 



2327 



2328 



ATGCCTGG GGCTAGCTACAACGA GGGCAGCA 



GAGGGATG GGCTAGCTACAACGA CTGGTGGG 



"TGGAGGGA GGCTAGCTACAACGA GCCTGGTG 



4630 



4631 



4632 



4633 



4634 



4635 



4636 
4637 
4638 



4639 
4640 



4641 



4642 



4643 



4644 



4645 



4646 



4647 



4648 



4649 



4650 



4651 



4652 



4653 



4654 



4655 



Input Sequence - HUMRasH_mRNA. Cut Site - R/Y 
Arm Length = 8. Core Sequence = GGCTAGCTACAACGA 
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Table TV: Human HER2 DNAzyme and Substrate Sequence 



Pos 


Substrate 


Seq 
ID 


DNAzyme 


Seq 
ID 


9 


AAGGGGAG G UAACCCUG 


4656 


CAGGGTTA GGCTAGCTACAACGA CTCCCCTT 


5644 


12 


GGGAGGUA A CCCUGGCC 


4657 


GGCCAGGG GGCTAGCTACAACGA TACCTCCC 


5645 


18 


UAACCCUG G CCCCUUUG 


4658 


CAAAGGGG GGCTAGCTACAACGA CAGGGTTA 


5646 


27 


CCCCUUUG G- UCGGGGCC 


4659 


GGCCCCGA GGCTAGCTACAACGA CAAAGGGG 


5647 


33 


UGGUCGGG G CCCCGGGC 


4660 


GCCCGGGG GGCTAGCTACAACGA CCCGACCA 


5648 


; 40 


GGCCCCGG G CAGCCGCG 


4661 


CGCGGCTG GGCTAGCTACAACGA CCGGGGCC 


5649 


*13 


CCCGGGCA G CCGCGCGC 


4662 


GCGCGCGG GGCTAGCTACAACGA TGCCCGGG 


5650 


46 


GGGCAGCC G CGCGCCCC 


4663 


GGGGCGCG GGCTAGCTACAACGA GGCTGCCC 


5651 


48 


GO\GCCGC G CGCCCCUU 


4664 


AAGGGGCG GGCTAGCTACAACGA GCGGCTGC 


5652 


: so 


AGCCGCGC G CCCCUUCC 


4665 


GGAAGGGG GGCTAGCTACAACGA GCGCGGCT 


5653 


60 


CCCUUCCC A CGGGGCCC 


4666 


GGGCCCCG GGCTAGCTACAACGA GGGAAGGG 


5654 


65 


CCCACGGG G CCCUUUAC 


4667 


GTAAAGGG GGCTAGCTACAACGA CCCGTGGG 


5655 


72 


GGCCCUUU A CUGCGCCG 


1 4668 


CGGCGCAG GGCTAGCTACAACGA AAAGGGCC 


5656 


75 


CCUUUACU G CGCCGCGC 


4669 


GCGCGGCG GGCTAGCTACAACGA AGTAAAGG 


5657 


77 


UUUACUGC G CCGCGCGC 


4670 


GCGCGCGG GGCTAGCTACAACGA GCAGTAAA 


5658 


80 


ACUGCGCC G CGCGCCCG 


4671 


CGGGCGCG GGCTAGCTACAACGA GGCGCAGT 


5659 


82 


UGCGCCGC G CGCCCGGC 


4672 


GCCGGGCG GGCTAGCTACAACGA GCGGCGCA 


5660 


84 


CGCCGCGC G CCCGGCCC 


4673 


GGGCCGGG GGCTAGCTACAACGA GCGCGGCG 


5661 


89 


CGCGCCCG G CCCCCACC 


4674 


GGTGGGGG GGCTAGCTACAACGA CGGGCGCG 


5662 


95 


CGGCCCCC A CCCCUCGC 


4675 


GCGAGGGG GGCTAGCTACAACGA GGGGGCCG 


5663 


102 


CACCCCUC G CAGCACCC 


4676 


GGGTGCTG GGCTAGCTACAACGA GAGGGGTG 


5664 


105 


CCCUCGCA G CACCCCGC 


4677 


GCGGGGTG GGCTAGCTACAACGA TGCGAGGG 


5665 


107 


CUCGCAGC A CCCCGCGC 


4678 


GCGCGGGG GGCTAGCTACAACGA GCTGCGAG 


5666 


112 


AGCACCCC G CGCCCCGC 


4679 


GCGGGGCG GGCTAGCTACAACGA GGGGTGCT 


5667 


114 


CACCCCGC G CCCCGCGC 


4680 


GCGCGGGG GGCTAGCTACAACGA GCGGGGTG 


5668 


119 


CGCGCCCC G CGCCCUCC 


4681 


GGAGGGCG GGCTAGCTACAACGA GGGGCGCG 


5669 


121 


CGCCCCGC G CCCUCCCA 


4682 


TGGGAGGG GGCTAGCTACAACGA GCGGGGCG 


5670 


130 


CCCUCCCA G CCGGGUCC 


4683 


GGACCCGG GGCTAGCTACAACGA TGGGAGGG 


5671 


135 


CCAGCCGG G UCCAGCCG 


4684 


CGGCTGGA GGCTAGCTACAACGA CCGGCTGG 


5672 


140 


CGGGUCCA G CCGGAGCC 


4685 


GGCTCCGG GGCTAGCTACAACGA TGGACCCG 


5673 


146 


CAGCCGGA G CCAUGGGG 


4686 


CCCCATGG GGCTAGCTACAACGA TCCGGCTG 


5674 


149 


CCGGAGCC A UGGGGCCG 


4687 


CGGCCCCA GGCTAGCTACAACGA GGCTCCGG 


5675 


154 


GCCAUGGG G CCGGAGCC 


4688 


GGCTCCGG GGCTAGCTACAACGA CCCATGGC 


5676 


160 


GGGCCGGA G CCGCAGUG 


4689 


CACTGCGG GGCTAGCTACAACGA TCCGGCCC 


5677 


163 


CCGGAGCC G CAGUGAGC 


4690 


GCTCACTG GGCTAGCTACAACGA GGCTCCGG 


5678 


166 


GAGCCGCA G UGAGCACC 


4691 


GGTGCTCA GGCTAGCTACAACGA TGCGGCTC 


5679 


170 


CGCAGUGA G CACCAUGG 


4692 


CCATGGTG GGCTAGCTACAACGA TCACTGCG 


5680 


172 


CAGUGAGC A CCAUGGAG 


4693 


CTCCATGG GGCTAGCTACAACGA GCTCACTG 


5681 


175 


UGAGCACC A UGGAGCUG 


4694 


CAGCTCCA GGCTAGCTACAACGA GGTGCTCA 


5682 


180 


ACCAUGGA G CUGGCGGC 


4695 


GCCGCCAG GGCTAGCTACAACGA TCCATGGT 


5683 


184 


UGGAGCUG G CGGCCUUG 


4696 


CAAGGCCG GGCTAGCTACAACGA CAGCTCCA 


5684 


187 


AGCUGGCG G CCUUGUGC . 


4697 


GCACAAGG GGCTAGCTACAACGA CGCCAGCT 


5685 


192 


GCGGCCUU G UGCCGCUG 


4698 


CAGCGGCA GGCTAGCTACAACGA AAGGCCGC 


5686 


194 


GGCCUUGU G CCGCUGGG 


4699 


CCCAGCGG GGCTAGCTACAACGA ACAAGGCC 


5687 


197 


CUUGUGCC G CUGGGGGC 


4700 


GCCCCCAG GGCTAGCTACAACGA GGCACAAG 


5688 


204 


CGCUGGGG G CUCCUCCU 


4701 


AGGAGGAG GGCTAGCTACAACGA CCCCAGCG 


5689 


214 


UCCUCCUC G CCCUCUUG 


4702 


CAAGAGGG GGCTAGCTACAACGA GAGGAGGA 


5690 
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222 I 


GCCCUCUU G CCCCCCGG 


4703 


rrfiGGGGG GGCTAGCTACAACGA AAGAGGGC T~ 


5691 


232 


CCCCCGGA G CCGCGAGC 


4704 


rnTrarna GGCTAGCTACAACGA TCCGGGGG 1 


-J O J 


235 


CCGGAGCC G CGAGCACC 


4705 


GGTGCTCG GGCTAGCTACAACGA GGCTCCGG j 


5693 


239 


AGCCGCGA G CACCCAAG 


4706 


rrTCzaClTCl GGCTAGCTACAACGA TCGCGGCT | 


-J O -7*1 


241 


CCGCGAGC A CCCAAGUG 


4707 


r^rrTRRn GGCTAGCTACAACGA GCTCGCGG 


JU J J 


247 


GCACCCAA G UGUGCACC 


4708 


nrTfiParA GGCTAGCTACAACGA TTGGGTGC 1 




249 


ACCCAAGU G UGCACCGG 


4709 


"rrfiGTGCA GGCTAGCTACAACGA ACTTGGGT 


5697 


251 


CCAAGUGU G CACCGGCA 


4710 


"TGCCGGTG GGCTAGCTACAACGA ACACTTGG 


5698 


253 1 


AAGUGUGC A CCGGCACA 


4711 


TGTGCCGG GGCTAGCTACAACGA GCACACiT I 


5699 


257 1 


GUGCACCG G CACAGACA 


4712 


TGTCTGTG GGCTAGCTACAACGA CGGTGCAC 


5700 


259J 


GCACCGGC A CAGACAUG 


4713 


CATGTCTG GGCTAGCTACAACGA GCCGGTGC 


5701 


263 


CGGCACAG A CAUGAAGC 


4714 


GCTTCATG GGCTAGCTACAACGA CTGTGCCG 


5702 


265 


GCACAGAC A UGAAGCUG 


4715 


CAGCTTCA GGCTAGCTACAACGA GTCTGTGC j 


5703 


270 


GACAUGAA G CUGCGGCU 


4716 


AGCCGCAG GGCTAGCTACAACGA TTCATGTC 


5704 


273 


AUGAAGCU G CGGCUCCC 


4717 


anaAGCCG GGCTAGCTACAACGA AGCTTCAT 


5705 


276 1 


AAGCUGCG G CUCCCUGC 


4718 


GCAGGGAG GGCTAGCTACAACGA CGCAGUx w r 1 


5706 


I 283 1 


GG CUC CCU G CCAGUCCC 


4719 


fr-narTnfl GGCTAGCTACAACGA AGGGAGCC 1 


5707 


287 


CCCUGCCA G UCCCGAGA 


4720 


TrrrnrWA GGCTAGCTACAACGA TGGCAGGG 1 


5708 


295 


GUCCCGAG A CCCACCUG 


4721 


rar^lTnno GGCTAGCTACAACGA CTCGGGAC 1 


5709 


299j 


CGAGACCC A CCUGGACA 


4722 


TTTPPAGG GGCTAGCTACAACGA GGGTCTCG 1 


-J / XV 


305 j 


CCACCUGG A CAUGCUCC 


4723 


naanrRTG GGCTAGCTACAACGA CCAGGTGG 


^711 


j 307 | 


ACCUGGAC A UGCUCCGC 


4724 


rcrofJAnCA GGCTAGCTACAACGA GTCCAGGT 


5712 


309 


CUGGACAU G CUCCGCCA 


4725 


Trnmnan GGCTAGCTACAACGA ATGTCCAG 


5713 


314 


CAUGCUCC G CCACCUCU 


4726 


arannTRfi GGCTAGCTACAACGA GGAGCATG 


5714 


317 


GCUCCGCC A CCUCUACC 


4727 


nrTARACJfl GGCTAGCTACAACGA GGCGGAGC 


5715 


j 323 


CCACCUCU A CCAGGGCU 


4728 


ar^rvrTfTG GGCTAGCTACAACGA AGAGGTGG 




329 


CUACCAGG G CUGCCAGG 


4729 


rrTCiaCAG GGCTAGCTACAACGA CCTGGTAG 

y e X vJV3 wXanJ www Jr**ww *"W^***ww* « -w- 


5717 


332 


CCAGGGCU G CCAGGUGG 


4730 


ppLrrTfin GGCTAGCTACAACGA AGCCCTGG 

w Wliw w X WW WWW IflW W A**W*»*»WW* 4 * 


5718 


J 337 


GCUGCCAG G UGGUGCACar 


4731 


PTGCACCA GGCTAGCTACAACGA CTGGCAGC 

W X W W«*w W/« WWW4.*1W W 


5719 


340 


GCCAGGUG G UGCAGGtrA 


4732 


TCCCTGCA GGCTAGCTACAACGA CACCTGGC 


5720 


342 


I CAGGUGGU G CAGGGAAA 


4733 


TTTCCCTG GGCTAGCTACAACGA ACCACCTG 


1 5721 


350 


GCAGGGAA A CCUGGAAC 


4734 


GTTCCAGG GGCTAGCTACAACGA TTCCCrGC 


5722 


357 


AACCUGGA A CULALLUA 


4735 


TAGGTGAG GGCTAGCTACAACGA TCCAGGTT 


5723 


361 


UGGAACUC A C(-UA(-uULj 


4736 


CAGGTAGG GGCTAGCTACAACGA GAGTTCCA 


1 5724 


365 


j ACUCACCU A CCUGCCUA 


4737 


TOC5GCAGG GGCTAGCTACAACGA AGGTGAGT 

X wwwwXlww www*x*ww***w*** ***^** * 


1 5725 


J 369 


1 ACCUACCU G CCCACCAA 


4738 


TTGGTGGG GGCTAGCTACAACGA AGGTAGGT 


1 5726 


j 373 


I ACCUGCCC A CCAAUGCC 


4739 


GGCATTGG GGCTAGCTACAACGA GGGCAGGT 


1 5727 


377 


j GCCCACCA A UGCCAGCC 


4740 


cinrTaarA GGCTAGCTACAACGA TGGTGGGC 


I 5728 


379 


1 CCACCAAU G CCAGCCUG 


4741 


rAflfiCTGG GGCTAGCTACAACGA ATTGGTGG 


1 5729 


383 


I CAAUGCCA G CCUGUCCU 


4742 


anrAfAGR GGCTAGCTACAACGA TGGCATTG 


1 t;77n 


387 


| GCCAGCCU G UCCUUCCU 


4743 


7vrmi\2\r;f?A GGCTAGCTACAACGA AGGuruGC 


1 ^7*51 

| ~J / -J X- 


396 


1 UCCUUCCU G CAGGAUAU 


4744 


a T 2i TrrTG GGCTAGCTACAACGA AGGAAGGA 


1 5732 


401 


1 CCUGCAGG A UAUCCAGG 


4745 


rPTGGATA GGCTAGCTACAACGA CCTGCAGG 


1 5733 


403 


1 UGCAGGAU A UCCAGGAG 


4746 


CTCCTGGA GGCTAGCTACAACGA ATCCTGCA 


5734 


412 


| UCCAGGAG G UGCAGGGC 


4747 


GCCCTGCA GGCTAGCTACAACGA CTCCTGGA 


5735 


414 


j CAGGAGGU G CAGGGCUA 


4748 


TAGCCCTG GGCTAGCTACAACGA ACCTCCTG 


5736 


419 


| GGUGCAGG G CUACGUGC 


4749 


GCACGTAG GGCTAGCTACAACGA CCTGCACC 


5737 


H422 


j GCAGGGCU A CGUGCUCA 


4750 


TGAGCACG GGCTAGCTACAACGA AGCCCTGC 


5738 


424 


( AGGGCUAC G UGCUCAUC 


4751 


GATGAGCA GGCTAGCTACAACGA GTAGCCCT 


5739 


426 


j GGCUACGU G CUCAUCGC 


I 4752 


GCGATGAG GGCTAGCTACAACGA ACGTAGCC 


5740 


430 


( ACGUGCUC A UCGCUCAC 


4753 


GTGAGCGA GGCTAGCTACAACGA GAGCACGT 


5741 


433 


| UGCUCAUC G CUCACAAC 


4754 


GTTGTGAG GGCTAGCTACAACGA GATGAGCA 


| 5742 
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437 


CAUCGCUC A CAACCAAG 


4755 


CTTGGTTG GGCTAGCTACAACGA GAGCGATG 


5743 


440 


CGCUCACA A CCAAGUGA 


4756 


TCACTTGG GGCTAGCTACAACGA TGTGAGCG 


5744 


445 


ACAACCAA G UGAGGCAG 


4757 


CTGCCTCA GGCTAGCTACAACGA TTGGTTGT 


5745 


450 


CAAGUGAG G CAGGUCCC 


4758 


GGGACCTG GGCTAGCTACAACGA CTCACTTG 


5746 


454 


UGAGGCAG G UCCCACUG 


4759 


CAGTGGGA GGCTAGCTACAACGA CTGCCTCA 


5747 


459 


CAGGUCCC A CUGCAGAG 


4760 


CTCTGCAG GGCTAGCTACAACGA GGGACCTG 


0 / *± 0 


462 


GUCCCACU G CAGAGGCU 


4761 


AGCCTCTG GGCTAGCTACAACGA AGTGGGAC 


3 / ft Z7 


468 


CUGCAGAG G CUGCGGAU 


4762 


ATCCGCAG GGCTAGCTACAACGA CTCTGCAG 


3 1 3 \J 


471 


CAGAGGCU G CGGAUUGU 


4763 


ACAATCCG GGCTAGCTACAACGA AGCCTCTG 




475 


GGCUGCGG A UUGUGCGA 


4764 


TCGCACAA GGCTAGCTACAACGA CCGCAGCC 


3 / 3 *U 


478 


UGCGGAUU G UGCGAGGC 


4 765 


GCCTCGCA GGCTAGCTACAACGA AATCCGCA 


j / 3j 


480 


CGGAUUGU G CGAGGCAC 


4766 


GTGCCTCG GGCTAGCTACAACGA ACAATCCG 


^ *7 CZA 
. O /Oft 


485 


UGUGCGAG G CACCCAGC 


4767 


GCTGGGTG GGCTAGCTACAACGA CTCGCACA 


3 / J j 


487 


UGCGAGGC A CCCAGCUC 


*± / o o 


GAGCTGGG GGCTAGCTACAACGA GCCTCGCA 


C7CC 

3 / 30 


492 


. GGCACCCA G CUCUUUGA 


*r / O 27 


TCAAAGAG GGCTAGCTACAACGA TGGGTGCC 


C7C7 
3 / -> / 


503 


CUUUGAGG.A CAACUAUG 


4770 


CATAGTTG GGCTAGCTACAACGA CCTCAAAG 


ir n u o 
3 /30 


506 


UGAGGACA A CUAUGCCC 


ft / / J. 


GGGCATAG GGCTAGCTACAACGA TGTCCTCA 


C7C Q 


509 


GGACAACU A UGCCCUGG 


ATT? 
ft / / 4* 


CCAGGGCA GGCTAGCTACAACGA AGTTGTCC 


O/dU 


511 


ACAACUAU G CCCUGGCC 


A TTX 

ft / / J 


GGCCAGGG GGCTAGCTACAACGA ATAGTTGT 


C7C1 

D/bX 


517 


AUGCCCUG G CCGUGCUA 


A 11 A 


TAGCACGG GGCTAGCTACAACGA CAGGGCAT 


3 / 0<S 


520 


CCCUGGCC G UGCUAGAC 


ft / / 3 


GTCTAGCA GGCTAGCTACAACGA GGCCAGGG 




522 


CUGGCCGU G CUAGACAA 


/77c 


TTGTCTAG GGCTAGCTACAACGA ACGGCCAG 


3 /oft 


527 


CGUGCUAG A CAAUGGAG 


4.7*7*7 
ft / / / 


CTCCATTG GGCTAGCTACAACGA CTAGCACG 


C *7 £T C 


530 


GCUAGACA A UGGAGACC 


477 8 


GGTCTCCA GGCTAGCTACAACGA TGTCTAGC 


*Z.H CLCL 
3 / O O 


536 


CAAUGGAG A CCCGCUGA 


A 77 Q 


TCAGCGGG GGCTAGCTACAACGA CTCCATTG 




540 


GGAGACCC G CUGAACAA 


a *7ft n 


TTGTTCAG GGCTAGCTACAACGA GGGTCTCC 


3 / O O 


545 


CCCGCUGA A CAAUACCA 


4781 


TGGTATTG GGCTAGCTACAACGA TCAGCGGG 


R*7£Q 
3 / O j 


548 


GCUGAACA A UACCACCC 


4782 


GGGTGGTA GGCTAGCTACAACGA TGTTCAGC 


3 / / U 


550 


UGAACAAU A CCACCCCU 


4783 


AGGGGTGG GGCTAGCTACAACGA ATTGTTCA 


3 / / -L 


553 


ACAAUACC A CCCCUGUC 


4784 


GACAGGGG GGCTAGCTACAACGA GGTATTGT 


3 / / « 


559 


CCACCCCU G UCACAGGG 


4785 


CCCTGTGA GGCTAGCTACAACGA AGGGGTGG 


3 / r J 


562 


CCCCUGUC A CAGGGGCC 


4786 


GGCCCCTG GGCTAGCTACAACGA GACAGGGG 


3 / / ft 


568 


UCACAGGG G CCUCCCCA 


4787 


TGGGGAGG GGCTAGCTACAACGA CCCTGTGA 


3 / / 3 


581 


CCCAGGAG G CCUGCGGG 


4788 


CCCGCAGG GGCTAGCTACAACGA CTCCTGGG 


3 / f O 


585 


GGAGGCCU G CGGGAGCU 


4789 


AGCTCCCG GGCTAGCTACAACGA AGGCCTCC 


*^777 
3 / / / 


591 


CUGCGGGA G CUGCAGCU 


4790 


AGCTGCAG GGCTAGCTACAACGA TCCCGCAG 


5778 


594 


CGGGAGCU G CAGCUUCG 


4 791 


CGAAGCTG GGCTAGCTACAACGA AGCTCCCG 


C77Q 


597 


GAGCUGCA G CUUCGAAG 


4792 


CTTCGAAG GGCTAGCTACAACGA TGCAGCTC 


3 / 0 V 


605 


GCUUCGAA G CCUCACAG 


A7Q7 


CTGTGAGG GGCTAGCTACAACGA TTCGAAGC 


3 / O X 


610 


GAAGCCUC A CAGAGAUC 


47 94 


GATCTCTG GGCTAGCTACAACGA GAGGCTTC 


3 / O 


616 


UCACAGAG A UCUUGAAA 


f* 1 73 


TTTCAAGA GGCTAGCTACAACGA CTCTGTGA 


J / OJ 


631 


AAGGAGGG G UCUUGAUC 


4796 


GATCAAGA GGCTAGCTACAACGA CCCTCCTT 


3 / O ft 


637 


GGGUCUUG A UCCAGCGG 


4797 


CCGCTGGA GGCTAGCTACAACGA CAAGACCC 


C70 c 

3/03 


642 


UUGAUCCA G CGGAACCC 


4798 


GGGTTCCG GGCTAGCTACAACGA TGGATCAA 


5786 


647 


CCAGCGGA A CCCCCAGC 


4799 


GCTGGGGG GGCTAGCTACAACGA TCCGCTGG 


5787 


654 


AACCCCCA G CUCUGCUA 


4800 


TAGCAGAG GGCTAGCTACAACGA TGGGGGTT 


5788 


659 


CCAGCUCU G CUACCAGG 


4801 


CCTGGTAG GGCTAGCTACAACGA AGAGCTGG 


5789 


662 


GCUCUGCU A CCAGGACA 


4802 


TGTCCTGG GGCTAGCTACAACGA AGCAGAGC 


5790 


668 


CUACCAGG A CACGAUUU 


4803 


AAATCGTG GGCTAGCTACAACGA CCTGGTAG 


5791 


670 


ACCAGGAC A CGAUUUUG 


4804 


CAAAATCG GGCTAGCTACAACGA GTCCTGGT 


5792 


673 


AGGACACG A UUUUGUGG 


4805 


CCACAAAA GGCTAGCTACAACGA CGTGTCCT 


5793 


678 


ACGAUUUU G UGGAAGGA 


4806 


TCCTTCCA GGCTAGCTACAACGA AAAATCGT 


5794 
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686 


GUGGAAGG 


A 


CAUCUUCC 


4807 


yT yT tv TV /T TV m/^ 

GGAAGATG 


/r /—i /Trn TV /r nm iv /""• tv /t/t 71 

GGCTAGCTACAACGA 


/T/T* 1 tl 1 */T /T ^ /T 


5795 


688 


GGAAGGAC 


A 


UCUUCCAC 


4808 


r'iti t/T /T TV Tv /T tv 

GTGGAAGA 


y~i ^"t /■ III 1 7V JT /'Hi 1 * /t/T tv 

GGCTAGCTACAACGA 


GTCCTTCC 


5796 


695 


CAUCUUCC 


A 


CAAGAACA 


4809 


TGTTCTTG 


/T/T f Ml 1 TV /n/"w»T» /TTV TV ^,ftTA 

GGCTAGCTACAACGA 


#T?/t A TV O 7V TV* 1 i 


5797 


701 


CCACAAGA 


A 


CAACCAGC 


4810 


GCTGGTTG 


yTSI / \k\\ TV ST ylrrT T\ STTV T\ /T/T TV 

GGCTAGCTACAACGA 


TCTTGTGG 


5798 


704 


CAAGAACA 


A 


CCAGCUGG 


4811 


CCAGCTGG 


y-T /— 1 / tl \ 1 TV Si Aril, Tt Si TV TV STST TV 

GGCTAGCTACAACGA 


TGTTCTTG 


5799 


708 


AACAACCA 


G 


CUGGCUCU 


4812 


TV /T*tl /T ST /T Tl ST. 

AGAGCCAG 


nn/lmKA/liTiii/^v TV /t/t T\ 

GGCTAGCTACAACGA 


TGGTTGTT 


5800 


712 


ACCAGCUG 


G 


CUCUCACA 


4813 


TGTGAGAG 


AAATTIll /I /"It I'l TV ST TV TV /T/T T\ 

GGCT AG CT ACAACGA 


CAGCTGGT 


5801 


718 


UGGCUCUC 


A 


CACUGAUA 


4814 


TATCAGTG 


y-T yT yTT IT TV ST Am T\ yTT\ IV /I /I TV 

GGCTAGCTACAACGA 


/— 1 TV /-l TV /-I /^1/~1TV 

GAGAGCCA 


5802 


720 


GCUCUCAC 


A 


CUGAUAGA 


4815 


m/trtrn m sr t\ sr 

TCTATCAG 


/in/win /T/Trr»7v /t tv tv t\ 

GGCTAGCTACAACGA 


GTGAGAGC 


5803 


724 


UCACACUG 


A 


UAGACACC 


4816 


yrsr rTV/T ati sn ■ t t\ 

GGTGTCTA 


^t/t/tttitv n/Tm* /ttv tv /t/t TV 

GGCTAGCTACAACGA 


CAGTGTGA 


5804 


728 


ACUGAUAG 


A 


CACCAACC 


4817 


GGTTGGTG 


/T/T /TmTV /"I nrflU /TTV TV /T/T TV 

GGCTAGCTACAACGA 


CT AT CAGT 


5805 


730 


UGAUAGAC 


A 


CCAACCGC 


4818 


/"i/Vt/*(fTimn sr 

GCGGTTGG 


/T/T /Tfn tv /^t /tj it tv /ttv tv f~ir~* TV 

GGCTAGCTACAACGA 


f~**T*/~ifT* TV »T» TV 

GTCTATCA 


5806 


7 3*1 


AGACACCA 


A 


CCGCUCUC 


4819 


/T TV /** TV /T STST ST 

GAGAGCGG 


j-i /t / ii 1 1 tv /T nrfn\ /~itv tv tv 

GGCTAGCTACAACGA 


rTl/T/TfTI/T AT / U 1 1 

rGGlGTCT 


5807 


■ 737 


CACCAACC 


G 


STT" T/"IT T/T/T i /-r 

CuCUCGGG 


4820 


ST /T /T/T TV ST TV /T 

CCCGAGAG 


z^i /-^ nm tv /™i /^m T\ r~\ tv tv f~*r~* TV 

GGCTAGCTACAACGA 


GGriGGIG 


5808 


745 


GCUCUCGG 


G 


#-1 /TT T/T /T ST "TV /T 

CCUGCCAC 


4821 


/rm^*t/T/"»Tv /t/t 

GTGGCAGG 


y^l /-» / tl II TV i'l nm TV /"1 7V TV /~*f~* TV 

GG CT AG CT ACAACGA 


CCGAGAGC 


5809 


74 9 


VCGGGCCU 


G 


CCACCCCU 


4822 


AGGGGTGG 


/T/TOfHTk /^m T\ y"t TV TV /T/1 T\ 

GGCTAGCTACAACGA 


TV ^t /^t /t /—t/~ys~i 7\ 

AGGCCCGA 


5810 


752 


GGCCUGCC 


A 


CCCCUGUU 


4823 


AACAGGGG 


GGCTAGCTACAACGA 


GGCAGGCC 


5811 


758 


CCACCCCU 


G 


UUCUCCGA 


4824 


TCGGAGAA 


AA AfTIH A/*lmH /**»TV TV AAT\ 

GGCTAGCTACAACGA 


TV AAA/ 11 1 l/T/AJ 

AGGGGTGG 


5812 


766 


GUUCUCCG 


A 


UGUGUAAG 


4825 


CTTACACA 


GGCTAGCTACAACGA 


AtAl/«i» /T TV TV /T 

CGGAGAAC 


5813 


768 
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/rr-ri pp rt ft ft ft OOOT*A OOrpA OA A OO A TPPPRPTP 

CTGGCCGG GGCTAGCTACAACGA X\sbCAb 


6034 


1671 


/T/T/TTV TV r!/T/T AT /*l^V7i /T TV AT AT TV 

GCCAACCG G CCAGAGGA 


5047 


rppprnprppp ft ft prn A P PT 1 A PA A PP 7\ ft ft ft i I'U i ip /-r p 

TCCxCTGG GGCTAGCTACAACGA Cbbi 1 bGC 


6035 


1679 


/"* /T r"l TV ft TV ftft TV 1*1/^ H /^TWITT/T 

GCCAGAGG A CGAGUGUG 


5048 


p-jv pa pmpp PPOT» A OOrpA OA A OO A P/ UMPTPPP 

CACACTCG GGCTAGCTACAACGA CCTCibGC 


6036 


1683 


/•T tv ft ft a ft ft a p TtnTT/^itonrt 

GAGGACGA G UGUGUGGG 


5049 


/T/-tp A PAPA ft ft f'tt ri tv p prp-^ pa A OO A CPOO'POOT'O 

CCCACACA GGCTAGCTACAACGA X Cb X CC X C 


6037 


1685 


AT ATI TV ft ft T\ /"^ T T /T TT/*ITT("iPO/V1 

GGACGAGU G UGUGGGCG 


5050 


a-iatpppa PA fftftrrt-fi pprp tv p A A OO A A PTPP T'OO 

CGCCCACA GGCTAGCTACAACGA ACXCbXCC 


6038 


1687 


TV ftftTK OTTOTT AT r TAT ft ft ft ft TV rT 

ACGAGUGU G UGGGCGAG 


5051 


a-wti ft ft r~t ft ft tv ft ft pvi-p7v P prpTV PA A OO A A OA PTPV «P 

CTCGCCCA GbC I AG CX ACAACbA ACACXCGT 


6039 


1691 


/TTT/tiTmTr<ri /*i atu-t tv pppp/*t 

GUGUGUGG G CGAGGGCC 


5052 


f* ft f^ ft ftrrtft ft ft at /—Trp tv p prp A P A A OO A pp A OA OA P 

GGCCCTCG GGCTAGCTACAACGA CCACACAC 


6040 


1697 


f*f*flftf*T\ ft ft f* /"T/TTTATPPrTTT 

GGGCGAGG G CCUGGCCU 


5053 


t\ atp /—ip tv PP OPO"T»A OOrpA OA A OO A f*f tr T'f m *i n, f^*f , f^ 

AGGCCAGG GGCTAGCTACAACGA CC X CbCCC 


6041 


1702 


tv f*f%f*ftfrw~tf* ft ftfctyftfiftn ft 

AGGGCCUG G CCUGCCAC 


5054 


/Trrr/T at at TV ft ft ft ft prp A PPT 1 A PA A PP A OA OOOOOT 1 

GTGGCAGG GGCTAGCTACAACGA CAbbCCC X 


6042 


1706 


CCUGGCCU G CCACCAGC 


5055 


GCTGGTGG GGCTAGCTACAACGA AGGCCAGG 


6043 


1709 


GGCCUGCC A CCAGCUGU 


5056 


TV AT TV AT AT m AT AT AT AT ATJTT TV AT/' If 1 ITV AT TV TV /TAT TV AT /T AT TV PPri/T 

ACAGCTGG GGCTAGCTACAACGA GGCAbGCC 


6044 


1713 


UGCCACCA G CUGUGCGC 


5057 


nnnrm nun at at atttttv at ArrriTV at a A ft ft tv mppmpppn 

GCGCACAG GGCTi AG CTACAACGA TGGTbbCA 


6045 


1 / lb 


CACCAbCU C UbCbCCCb 


5058 


LbbbbbbA bbC X ALiC 1 ALAALbA AVj^. X VjVj JLVj 


6046 


1718 


CCAGCUGU G CGCCCGAG 


5059 


CTCGGGCG GGCTAGCTACAACGA ACAGCTGG 


6047 


1720 


AGCUGUGC G CCCGAGGG 


5060 


CCCTCGGG GGCTAGCTACAACGA GCACAGCT 


6048 


1728 


GCCCGAGG G CACUGCUG 


5061 


CAGCAGTG GGCTAGCTACAACGA CCTCGGGC 


6049 


1730 


CCGAGGGC A CUGCUGGG 


5062 


CCCAGCAG GGCTAGCTACAACGA GCCCTCGG 


6050 


1733 


AGGGCACU G CUGGGGUC 


5063 


GACCCCAG GGCTAGCTACAACGA AGTGCCCT 


6051 


1739 


CUGCUGGG G UCCAGGGC 


5064 


GCCCTGGA GGCTAGCTACAACGA CCCAGCAG 


6052 


1746 


GGUCCAGG G CCCACCCA 


5065 


TGGGTGGG GGCTAGCTACAACGA CCTGGACC 


6053 


1750 


CAGGGCCC A CCCAGUGU 


5066 


ACACTGGG GGCTAGCTACAACGA GGGCCCTG 


6054 
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1755 


CCCACCCA G UGUGUCAA 


5067 


TTGACACA GGCTAGCTACAACGA TGGGTGGG 


6055 


1757 


CACCCAGU G UGUCAACU 


5068 


AGTTGACA GGCTAGCTACAACGA ACTGGGTG 


n r\ c a 

6056 


1759 


CCCAGUGU G UCAACUGC 


5069 


GCAGTTGA GGCTAGCTACAACGA ACACTGGG 


6057 


1763 


GUGUGUCA A CUGCAGCC 


5070 


GGCTGCAG GGCTAGCTACAACGA TGACACAC 


6u58 


1766 


UGUCAACU G CAGCCAGU 


5071 


ACTGGCTG GGCTAGCTACAACGA AGTTGACA 


6059 


1769 


CAACUGCA G CCAGUUCC 


5072 


GGAACTGG GGCTAGCTACAACGA TGCAGTTG 


6060 


1773 


UGCAGCCA G UUCCUUCG 


5073 


CGAAGGAA GGCTAGCTACAACGA TGGCTGCA f 


6061 


1784 


CCUUCGGG G CCAGGAGU 


5074 


ACTCCTGG GGCTAGCTACAACGA CCCGAAGG 


6062 


1791 


GGCCAGGA G UGCGUGGA 


5075 


TCCACGCA GGCTAGCTACAACGA TCCTGGCC 

A wvTlvwW* V* X flwv X JT*W**^*^^^XX*i X 1 Wwww 


6063 


1793 


CCAGGAGU G CGUGGAGG 


5076 


CCTCCACG GGCTAGCTACAACGA ACTCCTGG 

n— ' V_* -L w** W^Tw< Wv WV*W XX^VJTW* X /lwfl/*wwV* w 1 w W 1 WW 


6064 


1795 


AGGAGUGC G UGGAGGAA 1 


5077 


TTCCTCCA GGCTAGCTACAACGA GCACTCCT 


6065 


1803 


GUGGAGGA A UGCCGAGU 


5078 


ACTCGGCA GGCTAGCTACAACGA TCCTPCAC 


6066 


1805 


GGAGGAAU G CCGAGUAC 


5079 


GTACTCGG GGCTAGCTACAACGA ATTCCTCC 


6067 


1810 


AAUGCCGA G UACUGCAG 


5080 


CTGCAGTA GGCTAGCTACAACGA TCGGCATT 


6068 


1812 


UGCCGAGU A CUGCAGGG 


5081 


CCCTGCAG GGCTAGCTACAACGA ACTCGGCA 


6069 


1815 


CGAGUACU G CAGGGGCU 

VMTlv W<^VV^ WJ W V-i/*\J\JVJVJ \w* w 


5082 


AGCCCCTG GGCTAGCTACAACGA AGTACTCG 


6070 


1821 


CUGCAGGG G CUCCCCAG 


5083 


CTGGGGAG GGCTAGCTACAACGA CCPTGCAG 


6071 


1833 


CCCAGGGA G UAUGUGAA 


5084 


TTCACATA GGCTAGCTACAACGA TCCCTGGG 


6072 


1835 


CAGGGAGU A UGUGAAUG 


5085 


PATTCArA GGCTAGrTAr'AACGA ACTPCCTG 


6073 


1837 


GGGAGUAU G UGAAUG CC 


5086 


GGCATTCA GGCTAGCTACAACGA ATACTCCC 


6074 


1841 


GUAUGUGA A UGCCAGGC 


5087 


GCCTGGCA GGCTAGCTACAACGA TCACATAC 


6075 


1843 


AUGUGAATT G CCAGGCAC 


5088 


GTGCCTGG GGCTAGCTACAACGA ATTCACAT 


6076 


1848 


AAUGCCAG G CACUGUTTTT 


5089 


AAACAGTG GGCTAGCTACAACGA CTGGCATT 


6077 


1850 


UGCCAGGC A CUGUUUGC 


5090 


GCAAACAG GGCTAGCTACAACGA GCCTGGCA 


6078 


1853 


CAGGCACU G UUUGCCGU 


5091 


ACGGCAAA GGCTAGCTACAACGA AGTGCCTG 


6079 


1857 


CACUGUUU G CCGUGCCA 


5092 


TGGCACGG GGCTAGCTACAACGA AAACAGTG 

x wwwxiwwvJ www x nw\« x iAwTuivwrA rw^wx\w x vj 


6080 


1860 


UGUUUGCC G UGCCACCC 


5093 


GGGTGGCA GGCTAGCTACAACGA GGCAAACA 

www x ww v-*«* www x **ww x rLwririvw«x www/viriwn 


6081 


1862 


UUUGCCGU G CCACCCUG 


5094 


CAGGGTGG GGCTAGCTACAACGA ACGGCAAA 

wXaWWW X WW WWW X **WW X x^wfi^vvJn *»wwwwrw* 


6082 


1865 


! GCCGUGCC A CCCUGAGU 


5095 


ACTCAGGG GGCTAGCTACAACGA GGCACGGC 


6083 


1872 


CACCCUGA G UGUCAGCC 


5096 


GGCTGACA GGCTAGCTACAACGA TCAGGGTG 


6084 


1874 


CCCUGAGU G UCAGCCCC 


5097 


GGGGCTGA GGCTAGCTACAACGA ACTCAGGG 

wwwww x xj** www x nww x i^w/mIwwxI x w*xwww 


6085 


1878 


GAGUGUCA G CCCCAGAA 


5098 


TTCTGGGG GGCTAGCTACAACGA TGACACTC 

x x Vh» x uuww www x nu\< x 4iwuiw\J*i x wnwiw x v— 


6086 


1886 


GCCCCAGA A UGGCUCAG 


5099 


CTGAGCCA GGCTAGCTACAACGA TCTGGGGC 

w X wJrXWwwil. Www XflWV X / fcWfaflw Wi* X w x wwww w 


6087 


1889 


CCAGAAUG G CUCAGUGA 


5100 


TCACTGAG GGCTAGCTACAACGA CATTCTGG 

X WrXy X XJX^AJ WWW X **WV-. X x^WiUAWWiA Wf* X X X VJFVJ 


6088 


.1894 


AUGGCUCA G UGACCUGU 


5101 


ACAGGTCA GGCTAGCTACAACGA TGAGCCAT 


6089 


1897 


GCUCAGUG A CCUGUUUU 


5102 


! AAAACAGG GGCTAGCTACAACGA CACTGAGC 


6090 


1901 


AGUGACCU G UUUUGGAC 


! 5103 


GTCCAAAA GGCTAGCTACAACGA AGGTCACT 


6091 


1908 


UGUUUUGG A CCGGAGGC 


5104 


GCCyTCCGG GGCTAGCTACAACGA CCAAAACA 


6092 


1915 


GACCGGAG G CUGACCAG 


5105 


CTGGTCAG GGCTAGCTACAACGA CTCCGGTC 

WXWW X \^XmJ WWw> X«IWW XaIWmIwwV* X VwwW X W* 


6093 


1919 


GGAGGCUG A CCAGUGUG 


5106 


CACACTGG GGCTAGCTACAACGA CAGCCTCC 


6094 


1923 


GCUGACCA G UGUGUGGC 


5107 


GCCACACA GGCTAGCTACAACGA TGGTCAGC 


6095 


1925 


UGACCAGU G UGUGGCCU 


5108 


AGGCCACA GGCTAGCTACAACGA ACTGGTCA 


6096 


1927 


ACCAGUGU G UGGCCUGU 


5109 


ACAGGCCA GGCTAGCTACAACGA ACACTGGT 


6097 


1930 


AGUGUGUG G CCUGUGCC 




GGCACAGG GGCTAGCTACAACGA CACACACT 




1934 


UGUGGCCU G UGCCCACU 


5111 


AGTGGGCA GGCTAGCTACAACGA AGGCCACA 


6099 


1936 


UGGCCUGU G CCCACUAU 


5112 


ATAGTGGG GGCTAGCTACAACGA ACAGGCCA 


6100 


1940 


CUGUGCCC A CUAUAAGG 


5113 


CCTTATAG GGCTAGCTACAACGA GGGCACAG 


6101 


1943 


UGCCCACU A UAAGGACC 


5114 


GGTCCTTA GGCTAGCTACAACGA AGTGGGCA 


6102 


1949 


CUAUAAGG A CCCUCCCU 


5115 


AGGGAGGG GGCTAGCTACAACGA CCTTATAG 


6103 


1961 


UCCCUUCU G CGUGGCCC 


5116 


GGGCCACG GGCTAGCTACAACGA AGAAGGGA 


6104 


1963 


CCUUCUGC G UGGCCCGC 


5117 


GCGGGCCA GGCTAGCTACAACGA GCAGAAGG 


6105 


1966 


UCUGCGUG G CCCGCUGC 


5118 


GCAGCGGG GGCTAGCTACAACGA CACGCAGA 


6106 
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1970 


CGUGGCCC G CUGCCCCA 


5119 


TGGGGCAG GGCTAGCTACAACGA GGGCCACG 


6107 


1973 


GGCCCGCU G CCCCAGCG 


5120 


CGCTGGGG GGCTAGCTACAACGA AGCGGGCC 


6108 


1979 


CUGCCCCA G CGGUGUGA 


5121 


TCACACCG GGCTAGCTACAACGA TGGGGCAG 


6109 


1982 


CCCCAGCG G UGUGAAAC 


5122 


GTTTCACA GGCTAGCTACAACGA CGCTGGGG 


6110 


1984 


CCAGCGGU G UGAAACCU 


5123 


AGGTTTCA GGCTAGCTACAACGA ACCGCTGG 


6111 


1989 


GGUGUGAA A CCUGACCU 


5124 


AGGTCAGG GGCTAGCTACAACGA TTCACACC 


6112 


1994 


GAAACCUG A CCUCUCCU 


5125 


AGGAGAGG GGCTAGCTACAACGA CAGGTTTC 


6113 


2003 


CCUCUCCU A CAUGCCCA 


5126 


TGGGCATG GGCTAGCTACAACGA AGGAGAGG 


6114 


2005 


UCUCCUAC A UGCCCAUC 


5127 


GATGGGCA GGCTAGCTACAACGA GTAGGAGA 


6115 


2007 


UCCUACAU G CCCAUCUG 


5128 


CAGATGGG GGCTAGCTACAACGA ATGTAGGA 


6116 


2011 


ACAUGCCC A UCUGGAAG 


5129 


CTTCCAGA GGCTAGCTACAACGA GGGCATGT 


6117 


2019 


AUCUGGAA G UUUCCAGA 


5130 


TCTGGAAA GGCTAGCTACAACGA TTCCAGAT 


6118 


2027 


GUUUCCAG A UGAGGAGG 


5131 


CCTCCTCA GGCTAGCTACAACGA CTGGAAAC 


6119 


2036 


UGAGGAGG G CGCAUGCC 


5132 


GGCATGCG GGCTAGCTACAACGA CCTCCTCA 


6120 


2038 


AGGAGGGC G CAUGCCAG 


5133 


CTGGCATG GGCTAGCTACAACGA GCCCTCCT 


6121 


2040 


GAGGGCGC A UGCCAGCC 


5134 


GGCTGGCA GGCTAGCTACAACGA GCGCCCTC 


6122 


2042 


GGGCGCAU G CCAGCCUU 


5135 


AAGGCTGG GGCTAGCTACAACGA ATGCGCCC 


6123 


2046 


GCAUGCCA G CCUUGCCC 


5136 


GGGCAAGG GGCTAGCTACAACGA TGGCATGC 


6124 


2051 


CCAGCCUU G CCCCAUCA 


5137 


TGATGGGG GGCTAGCTACAACGA AAGGCTGG 


6125 


2056 


CUUGCCCC A UCAACUGC 


5138 


GCAGTTGA GGCTAGCTACAACGA GGGGCAAG 


6126 


2060 


CCCCAUCA A CUGCACCC 


5139 


GGGTGCAG GGCTAGCTACAACGA TGATGGGG 


6127 


2063 


CAUCAACU G CACCCACU 


5140 


AGTGGGTG GGCTAGCTACAACGA AGTTGATG 


6128 


2065 


UCAACUGC A CCCACUCC 


5141 


GGAGTGGG GGCTAGCTACAACGA GCAGTTGA 


6129 


2069 


CUGCACCC A CUCCUGUG 


5142 


CACAGGAG GGCTAGCTACAACGA GGGTGCAG 


6130 


2075 


CCACUCCU G UGUGGACC 


5143 


GGTCCACA GGCTAGCTACAACGA AGGAGTGG 


6131 


2077 


ACUCCUGU G UGGACCUG 


5144 


CAGGTCCA GGCTAGCTACAACGA ACAGGAGT 


6132 


2081 


CUGUGUGG A CCUGGAUG 


5145 


CATCCAGG GGCTAGCTACAACGA CCACACAG 


6133 


2087 


GGACCUGG A UGACAAGG 


5146 


CCTTGTCA GGCTAGCTACAACGA CCAGGTCC 


6134 


2090 


CCUGGAUG A CAAGGGCU 


5147 


AGCCCTTG GGCTAGCTACAACGA CATCCAGG 


6135 


2096 


UGACAAGG G CUGCCCCG 


5148 


CGGGGCAG GGCTAGCTACAACGA CCTTGTCA 


6136 


2099 


CAAGGGCU G CCCCGCCG 


5149 


CGGCGGGG GGCTAGCTACAACGA AGCCCTTG 


6137 


2104 


GCUGCCCC G CCGAGCAG 


5150 


CTGCTCGG GGCTAGCTACAACGA GGGGCAGC 


6138 


2109 


CCCGCCGA G CAGAGAGC 


5151 


GCTCTCTG GGCTAGCTACAACGA TCGGCGGG 


6139 


2116 


AGCAGAGA G CCAGCCCU 


5152 


AGGGCTGG GGCTAGCTACAACGA TCTCTGCT 


6140 


2120 


GAGAGCCA G CCCUCUGA 


5153 


TCAGAGGG GGCTAGCTACAACGA TGGCTCTC 


6141 


2128 


GCCCUCUG A CGUCCAUC 


5154 


GATGGACG GGCTAGCTACAACGA CAGAGGGC 


6142 


2130 


CCUCUGAC G UCCAUCAU 


5155 


ATGATGGA GGCTAGCTACAACGA GTCAGAGG 


6143 


2134 


UGACGUCC A UCAUCUCU 


5156 


AGAGATGA GGCTAGCTACAACGA GGACGTCA 


6144 


2137 


CGUCCAUC A UCUCUGCG 


5157 


CGCAGAGA GGCTAGCTACAACGA GATGGACG 


6145 


2143 


UCAUCUCU G CGGUGGUU 


5158 


AACCACCG GGCTAGCTACAACGA AGAGATGA 


6146 


2146 


UCUCUGCG G UGGUUGGC 


5159 


GCCAACCA GGCTAGCTACAACGA CGCAGAGA 


6147 


2149 


CUGCGGUG G UUGGCAUU 


5160 


AATGCCAA GGCTAGCTACAACGA CACCGCAG 


6148 


2153 


GGUGGUUG G CAUUCUGC 


| 5161 


GCAGAATG GGCTAGCTACAACGA CAACCACC 


6149 


^155 


UGGUUGGC A UUCUGCUG 


5162 


j CAGCAGAA GGCTAGCTACAACGA GCCAACCA 


6150 


2160 


GGCAUUCU G CUGGUCGU 


5163 


I ACGACCAG GGCTAGCTACAACGA AGAATGCC 


6151 


2164 


UUCUGCUG G UCGUGGUC 


5164 


GACCACGA GGCTAGCTACAACGA CAGCAGAA 


6152 


2167 


UGCUGGUC G UGGUCUUG 


5165 


CAAGACCA GGCTAGCTACAACGA GACCAGCA 


6153 


2170 


UGGUCGUG G UCUUGGGG 


5166 


CCCCAAGA GGCTAGCTACAACGA CACGACCA 


6154 


2179 


UCUUGGGG G UGGUCUUU 


5167 


AAAGACCA GGCTAGCTACAACGA CCCCAAGA 


6155 


2182 


UGGGGGUG G UCUUUGGG 


5168 


CCCAAAGA GGCTAGCTACAACGA CACCCCCA 


6156 


2191 


UCUUUGGG A UCCUCAUC 


5169 


GATGAGGA GGCTAGCTACAACGA CCCAAAGA 


6157 


2197 


GGAUCCUC A UCAAGCGA 


5170 


; TCGCTTGA GGCTAGCTACAACGA GAGGATCC 


6158 
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2202 


CUCAUCAA G CGACGGCA 


Dl / l 


TGCCGTCG GGCTAGCTACAACGA TTGATGAG 


CI t> Q 


2205 


AUCAAGCG A CGGCAGCA 


j1 / & 


TGCTGCCG GGCTAGCTACAACGA CGCTTGAT 


ci cn 
o lb v 


2208 


AAGCGACG G CAGCAGAA 


Dl / J 


TTCTGCTG GGCTAGCTACAACGA CGTCGCTT 


CI CI 
OlDl 


2211 


CGACGGCA G CAGAAGAU 


3 1 / fx 


ATCTTCTG GGCTAGCTACAACGA TGCCGTCG 


ci co 


2218 


AGCAGAAG A UCCGGAAG 


3 _L / 3 


CTTCCGGA GGCTAGCTACAACGA CTTCTGCT 


c i co. 

O ID J 


2226 


AUCCGGAA G UACACGAU 


31 / O 


ATCGTGTA GGCTAGCTACAACGA TTCCGGAT 


CI CA 


2228 


CCGGAAGU A CACGAUGC 


"7*7 


GCATCGTG GGCTAGCTACAACGA ACTTCCGG 


0l03 


2230 


GGAAGUAC A CGAUGCGG 


^.1 HA 
31 / o 


CCGCATCG GGCTAGCTACAACGA GTACTTCC 


CI fifi 


2233 


AGUACACG A UGCGGAGA 


31 f ? 


TCTCCGCA GGCTAGCTACAACGA CGTGTACT 


ci co 
blo / 


2235 


UACACGAU G CGGAGACU 


ci on 


AGTCTCCG GGCTAGCTACAACGA ATCGTGTA 


oloo 


2241 


AUGCGGAG A CUGCUGCA 


CI Ol 


TGCAGCAG GGCTAGCTACAACGA CTCCGCAT 


C 1 C Q 

oioy 


2244 


CGGAGACU G CUGCAGGA 


CI oo 


TCCTGCAG GGCTAGCTACAACGA AGTCTCCG 


6170 


2247 


AGACUGCU G CAGGAAAC 


bibj 


GTTTCCTG GGCTAGCTACAACGA AGCAGTCT 


6171 


2254 


UGCAGGAA A CGGAGCUG 


C.1 PA 

olo4 


CAGCTCCG GGCTAGCTACAACGA TTCCTGCA 


c i n o 
617z 


2259 


GAAACGGA G CUGGUGGA 


CI QC 

jlO j 


TCCACCAG GGCTAGCTACAACGA TCCGTTTC 


C 1 Ti 

6173 


2263 


CGGAGCUG G UGGAGCCG 


ci or 
DlOO 


CGGCTCCA GGCTAGCTACAACGA CAGCTCCG 


6174 


2268 


CUGGUGGA G CCGCUGAC 


C 1 O *7 


GTCAGCGG GGCTAGCTACAACGA TCCACCAG 


6175 


2271 


GUGGAGCC G CUGACACC 


CI Q Q 

jloo 


GGTGTCAG GGCTAGCTACAACGA GGCTCCAC 


C T O C 


2275 


AGCCGCUG A CACCUAGC 


CI OQ 


GCTAGGTG GGCTAGCTACAACGA CAGCGGCT 


617 / 


2277 


CCGCUGAC A CCUAGCGG 


ci on 


CCGCTAGG GGCTAGCTACAACGA GTCAGCGG 


6178 


2282 


GACACCUA G CGGAGCGA 


CI Q1 

jiyi 


TCGCTCCG GGCTAGCTACAACGA TAGGTGTC 


oi / y 


2287 


CUAGCGGA G CGAUGCCC 


C 1 GO 


GGGCATCG GGCTAGCTACAACGA TCCGCTAG 


6180 


2290 


GCGGAGCG A UGCCCAAC 


C 1 O *3 
Dl" J 


GTTGGGCA GGCTAGCTACAACGA CGCTCCGC 


C 1 O 1 

6181 


2292 


GGAGCGAU G CCCAACCA 


C T OA 


TGGTTGGG GGCTAGCTACAACGA ATCGCTCC 


6182 


2297 


GAUGCCCA A CCAGGCGC 


31:73 


GCGCCTGG GGCTAGCTACAACGA TGGGCATC 


c t a o 


2302 


CCAACCAG G CGCAGAUG 


Diyt 


CATCTGCG GGCTAGCTACAACGA CTGGTTGG 


6184 


2304 


AACCAGGC G CAGAUGCG 


CI QO 


CGCATCTG GGCTAGCTACAACGA GCCTGGTT 


oloo 


2308 


AGGCGCAG A UGCGGAUC 


c; i qo 


GATCCGCA GGCTAGCTACAACGA CTGCGCCT 


CI QC 

oloo 


2310 


GCGCAGAU G CGGAUCCU 


t^T QQ 


AGGATCCG GGCTAGCTACAACGA ATCTGCGC 


olo / 


2314 


AGAUGCGG A UCCUGAAA 


conn 

3ZUU 


TTTCAGGA GGCTAGCTACAACGA CCGCATCT 


c 1 0 Q 
oloo 


2326 


UGAAAGAG A CGGAGCUG 


com 


CAGCTCCG GGCTAGCTACAACGA CTCTTTCA 


OlO 7 


2331 


GAGACGGA G CUGAGGAA 


3^£ KJZ. 


TTCCTCAG GGCTAGCTACAACGA TCCGTCTC 


ci on 
oiyu 


2341 


UGAGGAAG G UGAAGGUG 




CACCTTCA GGCTAGCTACAACGA CTTCCTCA 


CI Q1 

oiyi 


2347 


AGGUGAAG G UGCUUGGA 


C O C\A 


TCCAAGCA GGCTAGCTACAACGA CTTCACCT 


cn qo 


2349 


GUGAAGGU G CUUGGAUC 


cone 


GATCCAAG GGCTAGCTACAACGA ACCTTCAC 


C 1 Q7 

oiyj 


2355 


GUGCUUGG A UCUGGCGC 


Uo 


GCGCCAGA GGCTAGCTACAACGA CCAAGCAC 


CI QA 


2360 


UGGAUCUG G CGCUUUUG 


I com 


CAAAAGCG GGCTAGCTACAACGA CAGATCCA 


CI qc. 

O 173 


2362 


GAUCUGGC G CUUUUGGC 


CO HQ 


GCCAAAAG GGCTAGCTACAACGA GCCAGATC 


CI QC 


2369 


CGCUUUUG G CACAGUCU 


co no 


AGACTGTG GGCTAGCTACAACGA CAAAAGCG 


CI QO 

biy / 


2371 


CUUUUGGC A CAGUCUAC 


JZIU 


GTAGACTG GGCTAGCTACAACGA GCCAAAAG 


CI QQ 

biyo 


2374 


UUGGCACA G UCUACAAG 


D^ll 


CTTGTAGA GGCTAGCTACAACGA TGTGCCAA 


ci 00 

biyy 


2378 


CACAGUCU A CAAGGGCA 


ROTO 


TGCCCTTG GGCTAGCTACAACGA AGACTGTG 
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A PT TA PP A TT P P A P A T TPPP 

AGUAGGAU G GAGAUGGG 


5315 


P , P 1 P , A r PP x TY2 ^PTAPPTAPAAPPA ATPPTAP'l' 
LLLAXLXb bbL 1 AbL 1 ALHAUjft AlwlnLi 


6303 


2UXJ 


pp A T TP PAP A TTPPPPPP A 

LbAUbLAb A UbbGbbCA 


5316 


TV^P^PPPPA P^P2P t TAP_P t TAP^A APY^I A PTPPATPH 
XbLLLLLA bbL 1 AbL 1 AUnnLun v*xov~r*x\J\J 


6304 




APATTPJPPP P PAAPPTTPP 

AGAUGGGG G GAAGGUGC 


5317 


PPA PPT^TV- PPPTSPPTAPAaPPA PPPPATPT 
bLALL X Xb bbL X AbL X rt.LrtJhiv.bn. L v— x 


6305 


2824 


vjvjvjvj\^rvri.vj vj vy\JV*V*wlU L 


5318 


GATGGGCA GGCTAGCTACAACGA CTTGCCCC 


one 


2826 


GGCAAGGU G CCCAUCAA 


5319 


TTGATGGG GGCTAGCTACAACGA ACCTTGCC 


6307 


2830 


AGGUGCCC A UCAAGUGG 


5320 


CCACTTGA GGCTAGCTACAACGA GGGCACCT 


6308 


2835 


CCCAUCAA G UGGAUGGC 


5321 


GCCATCCA GGCTAGCTACAACGA TTGATGGG 


6309 


2839 


UCAAGUGG A UGGCGCUG 


5322 


CAGCGCCA GGCTAGCTACAACGA CCACTTGA 


6310 


2842 


AGUGGAUG G CGCUGGAG 


5323 


CTCCAGCG GCXTTAGCTACAACGA CATCCACT 


6311 


2844 


UGGAUGGC G CUGGAGUC 


5324 


GACTCCAG GGCTAGCTACAACGA GCCATCCA 


6312 


2850 


GCGCUGGA G UCCAUUCU 


5325 


AGAATGGA GGCTAGCTACAACGA TCCAGCGC 


6313 


2854 


UGGAGUCC A UUCUCCGC 


5326 


GCGGAGAA GGCTAGCTACAACGA GGACTCCA 


6314 
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5327 


7\ prTP/inp P*/~ , P W T , AP'P W T 1 A ft A A P'P'A PIP* IiP?\7\ T>(~~* 

AUCCjULAiVj CjHjU 1 ALjL. i ALAACtiA CjLiAVjAAIIj 


6315 


n p (? c 


Cnr^r^r^/^r^r* r* /"" , pip , ttttp , 7\ p» 
V-UL.L.GCCG G CGGUUGAL. 


5328 


GTGAACCG GGCTAGCaACAACGA CXjGLGGAG 


6316 


O O £TQ 
^ODd 


cn p* nnc r^c c t tt th a 00 P 1 a 


5329 


IvoGGIGAA GGClAGCxACAACGA CJGCUGGCG 


6317 


Q ft *70 


PV , p , p , p i ttttp» a rrrapPAn 
vaGGGG UUL A ^ GC AG GAG 


5330 


CTGGTGGG GGC^AGCTACAACGA GAACCGCC 


6318 


A O / D 


PiTTTTP , AP'P''P , a p'vap'ap'ttpi 
uUUUiLLL A CCAGAGUG 


5331 


CACTCTGG GGCTAGCTACAACGA GGGTGAAC 


6319 


OQOO 


f~Y" , APr , AP , A TTP'ATTPnTP'TT 


5332 


7\ P7\ r«T\ T»P7\ P*rp 7\ P> P"T« 7\ <~i7\ 7\ PV 1 A T'PT'P'P'TVl/*'* 

ACJACATCA GGCTAGC rACAACGA TCTGGxGG 


6320 


OQon 


r*PA^artiTTn a t tpit t^t tnT* a 


5333 


•T'P'PfTV P*7\ P»A P'P'P«'T17VP'/^«T»7\ /^TV 7\ P*P*7V P7\ PTPHinri 

TCCACACA GG LTI AG CT ACAA vJGA CTxGG 


6321 


*>ftft"7 


■r\\Jf\\j U \jx±\ LJ Vjj ULjUvjoAIjtU 


5334 


71 fillip /*1 7\ <~I7\ P«P»P»117V /^P»rn7\ #~»T\ 7\ P'P'TV 7\ T'P* A PTVWT! 

ACTCCACA GGC-TAGCrrACAACGA ATCACTCT 


6322 




Al^JTTPlATTnTT n TT/T/iArJTTTTA 
-rtLTUvjAULyL/ *jr L/vjv±ACjUvJA 


5335 


rp7\ 7\ PTnp7\ P , P , P»»T»'J\P'PUT»7\ /*17\ 7\ PV"' 7\ 7\ P*7\ T^P*7V /'Ml ■ 

TAACTCCA GGCTAGCTACAACGA ACATCACT 


6323 


0 ft Q4. 


T TP2T THT TPtPl A PI T TT T A T T/2riT TP* 


5336 


CACCATAA GGCTAGCTACAACGA TCCACACA 


6324 


rfS O -7 / 


f^TTnnariTTTT a TTPi/iTTrsTTp^A 


5337 


TPA PH PPT\ /■*»<"*l/ - wp7V/~l/-u'|l7v /~1 7\ 7\ /-t/^ 7\ 7V 7V pWT»piPlTV O 

UCACACCA GGCTAGCTACAACGA AACTCCAC 


6325 


oqnn 


rSAfSTTTTATTP! /tl TTP'TTY" 1 APTTP 1 


5338 


CAGTCACA GGCTAGCTACAACGA CATAACTC 


6326 




PiTTTTATTPif" 1 TT r* ttp 1 AP*TTP , "TTr» 

vjUUAUGGU vj UGAGUGUG 


5339 


PAPAPTP74 <~1/-1/ tl 1 IT\ /—1 /— irp7V /~»TV TV /"t/*i 7V TV /~t/^7\ rp7\ "A 

CACAGTCA GGCTAGCTACAACGA ACCATAAC 


6327 




7xnrzrznrmn a ot tpit Tm Trip 


5340 


P"P* A P<A P>P PPPTRPPTIVPAUPPTk P* A P» A P*P» 71 «T» 

CCACACAG GGCIAGCTACAACGA CACACCAT 


6328 






5341 


P'n^PtptP'A P*A P*PtP»rn7VPipMT'A /^A 71 PP7\ 7\ prp /-itv pi tv /^l 

CxCCCACA GGCTAGCTACAACGA AGTCACAC 


6329 


4S J* X u 


vjUGAGUGU G UGGGAGCU 


5342 


AGCTCCCA GGCTAGCTACAACGA ACAGTCAC 


6330 


*z 710 


ny t/-i t Trinr' a p< pnTP'ATTP'A p» 
IjUGUGGGA G GUGAUGAC 


5343 


GTCATCAG GGCTAGCTACAACGA TCCCACAC 


6331 


OQOft 


arxn.nn.nnr* a ttp* a p'tttttttt 


5344 


AAAAGTCA GGCTAGCTACAACGA CAGCTCCC 


6332 


6 J 


APtPTTPiATTP 1 a pt tt tt tt TPir" 1 p* 


5345 


CCCAAAAG GGCTAGCTACAACGA CATCAGCT 


6333 




p i ttttttttp , p , p 1 p 1 or 1 a a anni 
v-UUUUGGG G LUWiALLU 


5346 


AGGTTTGG GGCTAGCTACAACGA CCCAAAAG 


6334 




vjGGGCGAA A LLUUALbA 


5347 


TCGTAAGG GGCTAGCTACAACGA TTGGCCCC 


6335 




P'Ti A AP'P'TTTT A PV ATT/T/??f* A 
LAftiiLLUU A GGAUGGGA 


5348 


TCCCATCG GGCTAGCTACAACGA AAGGTTTG 


6336 




apvtttta P"V"» a t Tr~*(~* , (~~ t 7\.iT/~*r~* 
ACCUUACG A UCjULiALrCC 


5349 


GGATCCCA GGCTAGCTACAACGA CGTAAGGT 


6337 


OQCft 


AL-^jAULjljVj A ULLLAoLL 


5350 


GGCTGGGA GGCTAGCTACAACGA CCCATCGT 


6338 




VaVjAU V-L.t-A ^UUVjrtjVsAvj 


5351 


CTCCCGGG GGCTAGCTACAACGA TGGGATCC 


6339 






5352 


GTCAGGGA GGCTAGCTACAACGA CTCCCGGG 


6340 


-7 1 


ClATTPPrTTn A <^^*TT^^l^ T TTm/~ , 


5353 


ptp"»A l^»OA/^/~« pppfnitprirri7triRT\PP7t PRPPPTvmP 

CCAGCAGG GGCTAGCTACAACGA CAGGGATC 


6341 


O Q7fi 


ppi TfiAr^n'T n nT^nriAAAA 

LLUuALCU \j U.UWjVjAAAA 


5354 


1 1 11 CCAG GGCTAGCTACAACGA AGGTCAGG 


6342 


2991 


Ra/nf3f2f2a a r^nnr^i tp 00 

XvlvTVJVjVjVaA tor l^VjVjV_UVj^«»l - 


5355 


ptOP»A r*t"*r*r* pppniTtppmTi /-itv tv pp* fTPPPPrm»M 

GGCAGCCG GGC rAGCTACAACGA TCCCCCTT 


6343 






5356 


*nPPPPP7tP PPT*PT\ppniA niv ti ppti rTTrnrrinr' 
xGGGGCAG GGCTAGCTACAACGA CGCTCCCC 


6344 


2 997 




5357 


pppnipp/v PiP^fT<A /""»/*"*n">A fi7A 7\ pt/"i a 7\ nnnnnmn 
GGC 1 GGGG GGCIAGCTACAACGA AGCCGCTC 


6345 




c r iiCiC x c % c t c*'L. r* r , r , r , r*r , r t ATT 


5358 


AT 1 /"*'P*P'P'P'P* ppprn 7\ /~i /~trp7V /~l 7V TV f~*(~* T\ rn/~i/-i/~l|^»Pl7V/™« 

A 1 GGGGGG GGCT AG CTACAACGA TGGGGCAG 


6346 




anrrpppp a TTnTTnorv rr 1 


5359 


/~i/~i rrj/~i n7\ P7\ /~*/~l *~»rp7v pi /"WTlTV /~iTV 7V J^V™* 7v /-~i/~i/-if-i/^~i/~i /'trii 

GG1GCAGA GGCTAGCTACAACGA GGGGGG CT 


6347 




O PPP A TT/^TT P OA/ r, r , ATTTT/^' 


5360 


CAATGGTG GGCTAGCTACAACGA AGATGGGG 


6348 


J V X D 


! Of ATTPTTr^P 1 A f p»ATTTTP* ATT 


5361 


ATCAATGG GGCTAGCTACAACGA GCAGATGG 


6349 




TTpTTPlP'A r~«p< A T TT TP' A TTP*T TP 1 


5362 


PT\P7\TPHH /~1/~1 /"u 1 ITV /~1 r~»T17V /~17V TV r~*f~* 7V /-1/-«rn/~1 /~t TV /"I Tt 

GACATCAA GGCTAGCTACAACGA GGTGCAGA 


6350 




P* A P'P' A T TT TP 1 A TTP»TTP*TTA f*7\ 


5363 


TT/ - HT»TV /~l TV /™1T\ /*1/^I/^IT»TV /T/TTTITV /"»TV Tl/Ti^i7V /TJV TV ffl/T/^irTVi 

TGTAGACA GGCTAGCTACAACGA CAATGGTG 


6351 




P'P'ATTTTP' ATT P 1 TTPTTA PATTP 
LLAUUbAU tor UL-UAUAUta 


5364 


/~i TV rn/*irnit y~i tv /^i /*^nr Tt /^i / tin tv <t tv tv /~*ir~* tv tv m/~i tv tv rtrnn 

CATGTAGA GGCTAGCTACAACGA ATCAATGG 


6352 




TTP* ATTP'TTPTT A P^ATTP" ATT/~«7\ 


5365 


TGATCATG GGCTAGCTACAACGA AGACATCA 


6353 


7 n "5 1 


ATTPTTPTTAP* A TTP* ATTP»A TTH 


5366 


/^TV rrr<^iTv rn/^Tv /'wit tv /^i nm tv /~i it i\ /™iy^ tv /rmii/iTi n»m 

CATGATCA GGCTAGCTACAACGA GTAGACAT 


6354 




TTPTTA P» AT TP* A TTP1\ TTPfiTTO 

UUUAUiUG A UGaUGCJUC 


5367 


GACCATGA GGCTAGCTACAACGA CATGTAGA 


6355 


"3 D "3 7 


AP'ATTP' A TTP* A T T/~l/^tT T/~1Tk TV 7» 

ACAutaAUC A UGGUCaAA 


5368 


TTTGACCA GGCTAGCTACAACGA GATCATGT 


6356 


■a n a n 


TTP1ATTP 1 A TTP< P» TTP«A A ATT^*»TT 

ULjAUUALHj 0 UlAAAUGU 


5369 


tv HMniiym /~^^% /"w 1 1 tv /^»/-»t>tv tv /~i/^i tv /iniTViTim/iit 

ACATTTGA GGCTAGCTACAACGA CATGATCA 


6357 


3045 


AUGGUCAA A UGUUGGAIJ 


5370 




6358 


3047 


GGUCAAAU G UUGGAUGA 


5371 


TCATCCAA GGCTAGCTACAACGA ATTTGACC 


6359 


3052 


AAUGUUGG A UGAUUGAC 


5372 


GTCAATCA GGCTAGCTACAACGA CCAACATT 


6360 


3055 


GUUGGAUG A UUGACUCU 


5373 


AGAGTCAA GGCTAGCTACAACGA CATCCAAC 


6361 


3059 


GAUGAUUG A CUCUGAAU 


5374 


ATTCAGAG GGCTAGCTACAACGA CAATCATC 


6362 


3066 


GACUCUGA A UGUCGGCC 


5375 


GGCCGACA GGCTAGCTACAACGA TCAGAGTC 


6363 


3068 


CUCUGAAU G UCGGCCAA 


5376 


TTGGCCGA GGCTAGCTACAACGA ATTCAGAG 


6364 


3072 


GAAUGUCG G CCAAGAUU 


5377 


AATCTTGG GGCTAGCTACAACGA CGACATTC 


6365 


3078 


CGGCCAAG A UUCCGGGA 


5378 


TCCCGGAA GGCTAGCTACAACGA CTTGGCCG 


6366 
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TTnnnnrrA 0 t ti tp*ot tot to 
UUL-LXjLjLtA Lj UUvjLjUVjUv 


5379 


oaoTvooTvA oooTAoorziOAAoozi TrrrrjfiAA 

vjrtV-rtV-V-rtrt. vjvjV 1/iv-n/iVVJn X v-uv.vjvxrirt 


6367 




pw 1 ap^ttto r* t tp^t tp*t to a a 
Ljv-HjALjUUvj Vj UvjUvUvjAA 


5380 


'TTOTVOTVOTV OOOT'TVOOT'AOA AOOZi rAACTCPC! 


6368 




r'T^r'Tmnr'TT 0 T tpt tp -1 a a t tt t 
LjAviU ULjLjU Lj UL-UVjAAUU 


5381 


AATTPanA OOO r P7iOO r P2i on IV OO 21 APPAAPTP 
AAXXV-AVjA VjV3V-XAVjA«±AV^fvlV.Vjr». AVV-nAV.lv, 


6369 




OTTP*TTP*TTOA TV TTTTPT TO OOO 


5382 


OOOOTVOTiTV OOOTZXOOTZIOZIZIPOZI TOAf^APAP 
V-VjVrVji^VjxVrt VjVJV- X <rVVj V- X Av-n/iV.V3ft X V-rxvjjriv»AV- 


6370 


Tim 
310 / 


A TTTT/~«TTr^/~'/~i r~> P*ATTPSOOP^A 

AUULUCCC G vAUL-VjL.L-A 


5383 


TOOOOTlTYi OOOT'2100 r PA02l AOOA OOO AO A AT 
IVJV3V.LAXVJ vjVjV^Xx1vjV-x AV~rl/4V-.Vjfrl. Vj^jvaAvjAAJ. 


6371 




TT/tttP»P»/'"«P' j\ t t/^i f~« /"« P« A rif* 

UL-UvvvLjv A UGLivvAvjLj 


5384 


OOTYIOOOA OOOT 1 AOOTAOTl A OOA OOfW^AfiA 
V-t L V3VJV— V-rt. VjVjV. X>i.V3V- Xrt.V-rtrt.V-Vj>! Orv— VjVJVxrtVxrt. 


6372 


ill/ 


PPPPP^ T TP» P» P'P'A t~H~L(~ % A P* 

VCvVjvAUG vj vVaVjVjLjAv 


5385 


OTOOOTVIO OOOT A O OTA O Zi A OO A O A TO C*CWXCX 

V»X L.V-V- ± VjVj VJVJV- 1 AV9V 1 AVAAVV3A V_tt IV9W3V9V3 


6373 


jiiy 


r*f~ t T\r~*r t t~> a p'P'P'OOAOO 
VjVjvL^LjLjvj A L-vCvVALjL- 


5386 


OOTOOOOO OOOTAOOTAOA AOOA CC*rTCZCZC*C > 
Vj V- X vauVJVjVj VjVj v_ X AVJV X rt.V-~rtrtV-.Vj A. V- V- V- X VJV3V.V, 


6374 


JUb 


li/iLLLLU/i Vj LIjLUULUjU 


5387 


AOA A A OOO OOOTAOOTAOZi ZiOOA fPfSO/^JOTO 
AL*rtrtxxvjV.v7 VJV3TV— X V- X AV-AAVvjA X VJVjVjVj vj X V- 


6375 




PPPPPAPf 1 O PTTTTTTOTTOO 
VVVVVAvjV 0 LUUUbUoo 


5388 


OOAOA A AO OOOTAOOTAOAAOOA OOTf^nOOfi 
V- V-AV-AAnVj vjVjV_ X xxVjV, X rt V-rtrt. V- un VJV- X V7Vj\jvjVj 


6376 


Till 


A r-i /-ipi P*T TT TT T O T H^rTTr 1 A T TO 
AvjL-VjV-UUU O UVjvjUVAVJL- 


5389 


OATOAOOA OOOTAOOTAOAAOOA A AZXOOOOT 
VzA X Vi.rt.V_ V-rt, VjVj V- X rtVj V. X Av-AAV\JA AnrVVJV-vjV. X 


6377 


TIT/; 


LjVUUULiULi Vj UvJA.UL-L1A.vjj 


5390 


OTOOATOA OOOTAOOTAOAAOOA OAOAAAOO 
V-XAjVjrt.1 Virt. VjVjV^ X AVjv, X rt VAALuA V-AV-rtrt-rvVjV. 


6378 


3 i3y 


TTTTP'TTPVTTP" A T TP"P*A O TV 7\ TT 

UUGUGGUv A UvvaVjAAU 


5391 


A»T"rOTOOA OOOTAOOTAOAAOOA OAOOAOAA 
AxXV-XvjVjA LtVjL.1AoL.1ALAAL.V5A VjAV.L-rt.V-AA 


6379 


314b 


OATTOOAP*A 7\ TTPAnPAPTT 

vaUvVaLjA A UvjAVjVjAv.U 


5392 


AOTOOTOA OOOTAOOTAOAAOOA TPTOO A TO 
AVj 1 V. V- 1 V — rt. VjVjV. 1 AVjV- X AV-AAVVJA 1 VI VjVjA X Vj 


6380 


n CO 

31b-£ 


0 A A T TO 7V P*P* 7\ PTTTTPrT'PP 

VjAAULiALivj A L.UUVjLjVjL-V 


5393 


OOOOOZiAO OOOTAOOTAOAAOOA OOTOATTO 
VjAjL-L. V-rtAVj VjVjV- X rt.Vj V. X rt.V~rtrtV.Vjrt. V-V. X V — rt. XIV- 


6381 


3 lbo 


bUAL U U VjVj Vj v vL^Lj v vA 


5394 


rrpp PT'OOO OOOTAOOTAOAAOOA OOAAOTOO 
1 VjVj V* X VjVjVj Vj vsV X rtVj V. X rtV — rtrtV-VJ^l V-V.rtrt.Vj X V-V- 


6382 


3 lb3 


nnnppppA 0 rr"*^ p , ttp'/^ , pi 
UWibLLuA vj VVALjUv^L-V 


5395 


OOO AOTOO OOOTAOOTAOAAOOA TdCZdCHHIi 
LtvjVjAV. 1 VjVj VjVjV- X rtVjV- X rtV-rtrt.V_.Virt 1 VjVjVj v_ V- V-rt. 


6383 


31b/ 


OPP7VPriPA P* TTppr»TTTTP!P 
vvVaVjVvA Li UL.VVUUVjvj 


5396 


OOAAOOOA OOOTAOOTAOAAOOA HPOOPTOOO 
L.L-AAVjVjvjA Vjvj V- X AVJ V- 1 rt V-rtrt.V- VjJ-l X VjVj v- X VjvjVj 


6384 


31 /b 


TTP , P»P*TTTTP , P' A PTtPr'TifTTT 

UvvvUULjVj A L-AVjL-AL.L-U 


5397 


AOOTOOTO OOOTAOOTAOAAOOA OOAAOOOA 
AVjVj 1 V3V- 1 Vj VjVj V- X .rtVj V. XAV.-rtrtV.Vjrt. VV-_ttAV_»V_fVjrt, 


6385 


3 1 / y 


/-tt tt TPV" 1 A P 1 A O OA OOTTTTOTT 
vUUvjVjALA Vj LALLUULU 


5398 


AO A AOOHPO OOOTAOOTAOAAOOA TOTOOAAO 
AVjAAVjVj X VJ Vj Vj V_- X -rtva V. X rt.V--rtrt.V-Vjrt. X VJ X V- V-_-J_rtVj 


6386 


3181 


TTPP5\PAPP A P'P'TTT TPT T7V P» 

UvjVjAvALjL- A LLUULUAL 


5399 


OTAOAAOO OOOTAOOTAOAAOOA fZC^T*ClTC t r'Ti 
Vj 1 AVjAAVjVj VjVj V_ X AVj V. 1 ALAAV-VxA VjV- X Vj X V- V-rt 


6387 


3188 


CACLUULU A vvLtvUL-AV 


5400 


OTOAOOOO OOOTAOOTAOAAOOA AOAAOOTO 
Vj 1 VjAVj V. VjVj VjVj V- 1 AVjL. 1 AV-AAVVjA AVjAAVjLi 1 Vj 


6388 


3191 


PTTf TPTTA r^r* P 1 P*T TP 1 a pttpp 
LUULUALL Lt vULAV-UVjL. 


5401 


OOAOTOAO OOOTAOOTAOAAOOA OOT AO A AO 
VjV-AVj X VjAVj VjVj V. X rt.V_r V. X rt V-rtrtV-Vxrt. VjV->X-rlVxrtrtV_r 


6389 


319b 


TTT\ <^i/~«P> ^TTTP' A P^TTP* P'TTP'P* A 

UAL.L-LjL.UL, A vUvjVUvjVjA 


5402 


r T > ^ % ^ % T^.^ , ^ % ^!^.c , ooptaootaoa aooa oaoooota 

X L-V-AVj LAVj VjVj V- X rtVsrV- X AV.AAV.Vjrt. VjiiuVk-Vj X rt. 


6390 


3 19o 


PlOP'TTP* A P*TT P" P«T TptP 1 7\ P*P"* A 

LXJCUL-ACU Lj v U Vj Lj ALjLJA 


5403 


mpfH-ippAP OOOTAOOTAOAAOOA AOTOAOOO 
1 L. L. X L- LAVj LtVjL. X AVjL- 1 AL-AALXjA AVj 1Vj_H.vjV.Vj 


6391 


JZUb 


pi p»T TP5P 1 A P* P 1 A p»p» A T TP* A P» A 

LjVULjVjALjVj A VVjAUvjAvA 


5404 


T'O TO A TOO OOOTAOOTAOAAOOA OOTOPAOO 
1 Vj X V-rt.X V_Vj VjVjV. X rtVj V- X rt V^rtrtV-Vxrt V_ V X VVAuV 


6392 




PIP* A P!0 TV OO TV T TO A O A T TP1P* 
L^AvjvjAvVj A ULaAt-AULiVj 


5405 


OOATOTOA OOOTAOOTAOAAOOA CfyVHCVOC 1 
V. V-rt X Vj ± V — rt VjVjV- X AVaV- lnv-<viVvin VUiVV-lVV 


6393 




P'P' 7\ P 1 P' A TTP 1 A PAT TP1P 1 P* PIP 1 
LsLiAL-LrAULr A UH.U LjVjL7LtL» 


5406 


OOOOO ATO OOOTAOOTAOAAOOA OATOOTOO 
V. V_.V-V-V_rtX Vj vjvj V 1 AvjV 1 AwviVVaA Vni L.VJ1LL. 


6394 




AV-.Li.rt.UvjM.V- £\ IJ\jV?VjajV5«.V. 


5407 


OTOOOOOA OOOTAOOTAOAAOOA OTP A TOOT 

VJl VWVVA VJVJV- 1 AVaV lAV-AAV-VJA VJlV-nJLV-Ul 


6395 


■3991 


V-/\vJ\jvj\jVtV» A V-V.Uvj\JV1Vjvj 


5408 


PP A PP AGrt PrGPTAOPTAPAAPGA PPPPPATG 

VV~rt.V-V_-rt.VjVJ VJVJV, 1 AVj V 1 AVnnUVJA V-WWnXV_T 


6396 




ppp a P^PTTf P" 170021170011 
LjVjLi AL. V- U Vj vj ULjLjAUvjV-U 


5409 


AOOATOOA OOPTAOOTAOA A PO A PAfittTPPP 

AvjVAI VVA VJVJ V 1 AVJ V 1 AVAAVVjA V-AVJVJ 1 V-V-V- 


6397 


0 0 *i n 

O A Ji U 


OOT TOOT TPIO TV T Tp p«T TP1 A PIO 
V.L.UL7L7UL7L7 A UVyV-UvjAVjAj 


5410 


PPTPAOOA OOPTAOPTAPA APOA PPAPPAfiP* 
V-V- X V_~rtVJ V_rt VJVJ V- X AVj V- X AVAnVViA Vv-nV-V-nVJU 


6398 


•5909 


ttoottootvtt o ottozv007ao 
UVjVjUVjvjAU Ij L-UVtAVjLxAIj 


5411 


PTPPTPAO OOPTAOOTAOA APOA ATPPAPPA 
V» X V-V- X V—rtVj VJV3 V 1 AVj V 1 AVAnVV-A ni V-V-rtV-Urt 


6399 


J__4U 


OOT TOZV OO TV O TT2VTTOTTOOTT 
Vjv^UvjAvjVjA Vj UAUV— ULjVjU 


5412 


AOOAOATA OOPTAOPTAPA APOA TPPTP AOP 
AV-V-AVjAXA VjuVIAoVIAVAAVVjA IVVAV-rWjV, 


6400 


09/9 


TTOTVOOZVOTT 2V T TOT TOOT T A O 
ULtAvjvjAIjU A UV-ULjVtUAL. 


5413 


OTAOOAOA OOPTAOPTAPA APOA A PTPPTP A 

UlnLVnUn UUUlAuVl/iv-ArtUUA AL J. V-. V- X V~rt. 


6401 


J Z ft / 


TVOTT2VTTOTTO O TTZV OOOO 7AO 
Avj U AU L. U Vj vj UHVVVLHu 


5414 


PTPOOOTA OOPTAOPTAPA APOA PAP AT APT 
V- X Vj VjVj Vj X rt. VjVj V_. X rtVj V- X AVAAVuA V-AVJA X AV, X 


6402 


O O /» Q 


T TAT TOT TPIO TT 7V OOOOHOOZV 
UAUL-UVjLtU A V-I-V-L-ALtL-A 


5415 


TOPTOOOO OOPTAOPTAPA APOA APPAG1ATA 
1 VjV. X VjVj VjVj VjVj V. X AVjV- X AVAAVuA av,vavja X A 


6403 




OTT2V000021 O OA OOO OTTTT 
VjUAv.V-V.V-A VJ V_Avj\jV7 V- U U 


5416 


A AOPPPTO OOPTAOPTAPA APOA TGGGGTAP 

rtrtVjV- V-V.XVJ VjVJV- X AVJV- X AVAAVVJA X VJVJVJVJ X AV- 


6404 




PPAPPJVPP O OTTTTOTTTTOTT 

L-UALjL-AvjLj vj LUULUULU 


5417 


AO A AO A AO OOOTAOOTAOAAOOA PPTOPTOPl 
AVjAAVjAAVj VjVj V X AVj V X AVAAVVjA W X VJV X VJVJ 


6405 


99/;q 

3-_by 


' OTTTTOTTTTP*TT O T TOO A O A OO 
LUULUULU Lr U v_ L-ALiAL- V— 


5418 


OOTPTOOA OOPTAOOTAOA APOA AOA AOA AO 
VjVj X V X VJVJ A VjVj V X AVj V X AVAAVVjA AVjAAVJAAVJ 


6406 


997c 


OTTOTTOOAO 2i OOOTTOOOO 
L-UvaUL-L-Avj A L-L.L.UVJL.L.L- 


5419 


CWICZC^TxCXCICX OOPTAOOTAOA APOA PTPP4APAG 

VjVjVj VAVjVjVJ VJVJ V X AVj V X AVAAVVJA V X VJVJA VAVJ 


6407 




L/ibALLLU Vj V-^L.L-VjV>VjL. 


5420 


o rTTTzrzrzn ooptaootaoa aooa a dnaTcvci 

VjVVVVjVjVjVj VjVj VXAvjVX AVAAVVjA AVJVJVJXVXVJ 


6408 


9907 
->__o / 


T TCi~*(~ l t~ % t~ x t~lt~ % O OOPTT00020 
UvjL-L-L.L-V3V» vj L.V?L.UvjLtLtVj 


5421 


Cf^CC^rZCYZ OOOTAOOTAOAAOOA OPOP^JOPA 
VV VVAVjVVj VjVjV X AVj VX AVAAVVjA VVVJVJVJVjVA 


6409 


3289 


CCCCGGGC G CUGGGGGC 


5422 


GCCCCCAG GGCTAG CTACAACGA GCCCGGGG 


b41U 


3296 


| CGCUGGGG G CAUGGUCC 


5423 


GGACCATG GGCTAGCTACAACGA CCCCAGCG 


6411 


3298 


CUGGGGGC A UGGUCCAC 


5424 


GTGGACCA GGCTAGCTACAACGA GCCCCCAG 


6412 


3301 


GGGGCAUG G UCCACCAC 


5425 


GTGGTGGA GGCTAGCTACAACGA CATGCCCC 


6413 


3305 


CAUGGUCC A CCACAGGC 


5426 


GCCTGTGG GltCTAGCTACAACGA GGACCATG 


6414 


3308 


GGUCCACC A CAGGCACC 


5427 


GGTGCCTG GGCTAGCTACAACGA GGTGGACC 


6415 


3312 


CACCACAG G CACCGCAG 


5428 


CTGCGGTG GGCTAGCTACAACGA CTGTGGTG 


6416 


3314 


CCACAGGC A CCGCAGCU 


5429 


AGCTGCGG GGCTAGCTACAACGA GCCTGTGG 


6417 


3317 


CAGGCACC G CAGCUCAU 


5430 


ATGAGCTG GGCTAGCTACAACGA GGTGCCTG 


6418 
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3320 


GCACCGCA G CUCAUCUA 


5431 


TAGATGAG GGCTAGCTACAACGA TGCGGTGC 1 


6419 


3324 


CGCAGCUC A UCUACCAG 


5432 


CTGGTAGA GGCTAGCTACAACGA GAGCTGCG 


6420 


3328 


GCUCAUCU A CCAGGAGU 


5433 


ACTCCTGG GGCTAGCTACAACGA AGATGAGC 


6421 


3335 


UACCAGGA G UGGCGGUG ! 


5434 


CACCGCCA GGCTAGCTACAACGA TCCTGGTA 


6422 


3338 


CAGGAGUG G CGGUGGGG 


5435 


CCCCACCG GGCTAGCTACAACGA CACTCCTG 


6423 


3341 


GAGUGGCG G UGGGGACC 


5436 


GGTCCCCA GGCTAGCTACAACGA CGCCACTC 


6424 


3347 


CGGUGGGG A CCUGACAC 


5437 


GTGTCAGG GGCTAGCTACAACGA CCCCACCG 


6425 


3352 


GGGACCUG A CACUAGGG 


5438 


CCCTAGTG GGCTAGCTACAACGA CAGGTCCC 


6426 


3354 


GACCUGAC A CUAGGGCU 


5439 


AGCCCTAG GGCTAGCTACAACGA GTCAGGTC 


6427 


3360 


ACACUAGG G CUGGAGCC 


5440 


GGCTCCAG GGCTAGCTACAACGA CCTAGTGT 


6428 


3 366 


GGGCUGGA G CCCUCUGA 


5441 


TCAGAGGG GGCTAGCTACAACGA TCCAGCCC 


6429 


3 382 


AAGAGGAG G CCCCCAGG 


5442 


CCTGGGGG GGCTAGCTACAACGA CTCCTCTT 


6430 


3 390 


GCCCCCAG G UCUCCACU 


5443 


AGTGGAGA GGCTAGCTACAACGA CTGGGGGC 


6431 


3396 


AGGUCUCC A CUGGCACC 


5444 


GGTGCCAG GGCTAGCTACAACGA GGAGACCT 


6432 


3400 


CUCCACUG G CACCCUCC 


5445 


GGAGGGTG GGCTAGCTACAACGA CAGTGGAG 


6433 


3402 


CCACUGGC A CCCUCCGA^ 


5446 


TCGGAGGG GGCTAGCTACAACGA GCCAGTGG 


6434 


3415 


CCGAAGGG G CUGGCUCC 


5447 


GGAGCCAG GGCTAGCTACAACGA CCCTTCGG 


6435 


3419 


AGGGGCUG G CUCCGAUG 


5448 


CATCGGAG GGCTAGCTACAACGA CAGCCCCT 


6436 


3425 


UGGCUCCG A UGUAUUUG 


5449 


CAAATACA GGCTAGCTACAACGA CGGAGCCA 


6437 


3427 


GCUCCGAU G UAUUUGAU 


5450 


ATCAAATA GGCTAGCTACAACGA ATCGGAGC 


6438 


3429 


UCCGAUGU A UUUGAUGG 


5451 


CCATCAAA GGCTAGCTACAACGA ACATCGGA 


6439 


3434 


UGUAUUUG A UGGUGACC 


5452 


GGTCACCA GGCTAGCTACAACGA CAAATACA 


6440 


3437 


AUUUGAUG G UGACCUGG 


5453 


CCAGGTCA GGCTAGCTACAACGA CATCAAAT 


6441 


3440 


UGAUGGUG A CCUGGGAA 


5454 


TTCCCAGG GGCTAGCTACAACGA CACCATCA 


6442 


3448 


ACCUGGGA A UGGGGGCA 


5455 


TGCCCCCA GGCTAGCTACAACGA TCCCAGGT 


6443 


3454 


GAAUGGGG G CAGCCAAG 


5456 


CTTGGCTG GGCTAGCTACAACGA CCCCATTC 


6444 


3457 


UGGGGGCA G CCAAGGGG 


5457 


CCCCTTGG GGCTAGCTACAACGA TGCCCCCA 


6445 


3465 


GCCAAGGG G CUGCAAAG 


5458 


CTTTGCAG GGCTAGCTACAACGA CCCTTGGC 


6446 


3468 


AAGGGGCU G CAAAGCCU 


5459 


AGGCTTTG GGCTAGCTACAACGA AGCCCCTT 


6447 


3473 


GCUGCAAA G CCUCCCCA 


5460 


TGGGGAGG GGCTAGCTACAACGA TTTGCAGC 


6448 


3481 


GCCUCCCC A CACAUGAC 


5461 


GTCATGTG GGCTAGCTACAACGA GGGGAGGC 


6449 


3483 


CUCCCCAC A CAUGACCC 


5462 


GGGTCATG GGCTAGCTACAACGA GTGGGGAG 


6450 


3485 


CCCCACAC A UGACCCCA 


5463 


TGGGGTCA GGCTAGCTACAACGA GTGTGGGG 


6451 


3488 


CACACAUG A CCCCAGCC 


5464 


GGCTGGGG GGCTAGCTACAACGA CATGTGTG 


6452 


3494 


UGACCCCA G CCCUCUAC 


5465 


GTAGAGGG GGCTAGCTACAACGA TGGGGTCA 


6453 


3501 


AGCCCUCU A CAGCGGUA 


5466 


TACCGCTG GGCTAGCTACAACGA AGAGGGCT 


6454 


3504 


CCUCUACA G CGGUACAG 


5467 


CTGTACCG GGCTAGCTACAACGA TGTAGAGG 


6455 


3507 


CUACAGCG G UACAGUGA 


5468 


TCACTGTA GGCTAGCTACAACGA CGCTGTAG 


6456 


3509 


ACAGCGGU A CAGUGAGG 


5469 


CCTCACTG GGCTAGCTACAACGA ACCGCTGT 


6457 


3512 


GCGGUACA G UGAGGACC 


5470 


GGTCCTCA GGCTAGCTACAACGA TGTACCGC 


6458 


3518 


CAGUGAGG A CCCCACAG 


5471 


CTGTGGGG GGCTAGCTACAACGA CCTCACTG 


6459 


3523 


AGGACCCC A CAGUACCC 


5472 


GGGTACTG GGCTAGCTACAACGA GGGGTCCT 


6460 


3526 


ACCCCACA G UACCCCUG 


5473 


CAGGGGTA GGCTAGCTACAACGA TGTGGGGT 


6461 






5474 


GGCAGGGG GGCTAGCTACAACGA ACTGTGGG 


6462 


3534 


GUACCCCU G CCCUCUGA 


5475 


TCAGAGGG GGCTAGCTACAACGA AGGGGTAC 


6463 


3544 


CCUCUGAG A CUGAUGGC 


5476 


GCCATCAG GGCTAGCTACAACGA CTCAGAGG 


6464 


3548 


UGAGACUG A UGGCUACG 


5477 


CGTAGCCA GGCTAGCTACAACGA CAGTCTCA 


6465 


3551 


GACUGAUG G CUACGUUG 


5478 


CAACGTAG GGCTAGCTACAACGA CATCAGTC 


6466 


3554 


UGAUGGCU A CGUUGCCC 


5479 


GGGCAACG GGCTAGCTACAACGA AGCCATCA 


6467 


3556 


AUGGCUAC G UUGCCCCC 


5480 


GGGGGCAA GGCTAGCTACAACGA GTAGCCAT 


6468 


3559 


GCUACGUU G CCCCCCUG 


5481 


CAGGGGGG GGCTAGCTACAACGA AACGTAGC 


6469 


3568 


CCCCCCUG A CCUGCAGC 


5482 


GCTGCAGG GGCTAGCTACAACGA CAGGGGGG 


6470 
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3572 


CCUGACCU 


G 


CAGCCCCC 


5483 


GGGGGCTG 


GGCTAGCTACAACGA 


AGGTCAGG 


6471 


3575 


GACCUGCA 


G 


CCCCCAGC 


5484 


GCTGGGGG 


GGCTAGCTACAACGA 


TGCAGGTC 


6472 


3582 


AGCCCCCA 


G 


CCUGAAUA 


5485 


TATTCAGG 


GGCTAGCTACAACGA 


TGGGGGCT 


6473 


3588 


CAGCCUGA 


A 


UAUGUGAA 


5486 


TTCACATA 


GGCTAGCTACAACGA 


TCAGGCTG 


6474 


3590 


GCCUGAAU 


A 


UGUGAACC 


5487 


GGTTCACA 


GGCTAGCTACAACGA 


ATTCAGGC 


6475 


3592 


CUGAAUAU 


G 


UGAACCAG 


5488 


CTGGTTCA 


GGCTAGCTACAACGA 


ATATTCAG 


6476 


3596 


AUAUGUGA 


A 


CCAGCCAG 


5489 


CTGGCTGG 


GGCTAGCTACAACGA 


TCACATAT 


6477 


3600 


GUGAACCA 


G 


CCAGAUGU 


5490 


ACATCTGG 


GGCTAGCTACAACGA 


TGGTTCAC 


6478 


3605 


CCAGCCAG 


A 


UGUUCGGC 


5491 


GCCGAACA 


GGCTAGCTACAACGA 


CTGGCTGG 


6479 


3607 


AGCCAGAU 


G 


UUCGGCCC 


5492 


GGGCCGAA 


GGCTAGCTACAACGA 


ATCTGGCT 


6480 


3612 


GAUGUUCG 


G 


CCCCAGCC 


5493 


GGCTGGGG 


GGCTAGCTACAACGA 


CGAACATC 


6481 


3618 


CGGCCCCA 


G 


CCCCCUUC 


5494 


GAAGGGGG 


GGCTAGCTACAACGA 


TGGGGCCG 


6482 


3627 


CCCCCUUC 


G 


CCCCGAGA 


5495 


TCTCGGGG 


GGCTAGCTACAACGA 


GAAGGGGG 


6483 


3638 


CCGAGAGG 


G 


CCCUCUGC 


5496 


GCAGAGGG 


GGCTAGCTACAACGA 


CCTCTCGG 


6484 


3645 


GGCCCUCU 


G 


CCUGCUGC 


5497 


GCAGCAGG 


GGCTAGCTACAACGA 


AGAGGGCC 


6485 


3649 


CUCUGCCU 


G 


CUGCCCGA 


5498 


TCGGGCAG 


GGCTAGCTACAACGA 


AGGCAGAG 


6486 


3652 


UGCCUGCU 


G 


CCCGACCU 


5499 


AGGTCGGG 


GGCTAGCTACAACGA 


AGCAGGCA 


6487 


3657 


GCUGCCCG 


A 


CCUGCUGG 


5500 


CCAGCAGG 


GGCTAGCTACAACGA 


CGGGCAGC 


6488 


3661 


CCCGACCU 


G 


CUGGUGCC 


5501 


GGCACCAG 


GGCTAGCTACAACGA 


AGGTCGGG 


6489 


3665 


ACCUGCUG 


G 


UGCCACUC 


5502 


GAGTGGCA 


GGCTAGCTACAACGA 


CAGCAGGT 


6490 


3667 


CUGCUGGU 


G 


CCACUCUG 


5503 


CAGAGTGG 


GGCTAGCTACAACGA 


ACCAGCAG 


6491 


3670 


CUGGUGCC 


A 


CUCUGGAA 


5504 


TTCCAGAG 


GGCTAGCTACAACGA 


GGCACCAG 


6492 


3681 


CUGGAAAG 


G 


CCCAAGAC 


5505 


GTCTTGGG 


GGCTAGCTACAACGA 


CTTTCCAG 


6493 


3688 


GGCCCAAG 


A 


CUCUCUCC 


5506 


GGAGAGAG 


GGCTAGCTACAACGA 


CTTGGGCC 


6494 


3707 


AGGGAAGA 


A 


UGGGGUCG 


5507 


CGACCCCA 


GGCTAGCTACAACGA 


TCTTCCCT 


6495 


3712 


AGAAUGGG 


G 


UCGUCAAA 


5508 


TTTGACGA 


GGCTAGCTACAACGA 


CCCATTCT 


6496 


3715 


AUGGGGUC 


G 


UCAAAGAC 


5509 


GTCTTTGA 


GGCTAGCTACAACGA 


GACCCCAT 


6497 


3722 


CGUCAAAG 


A 


CGUUtJUUG 


5510 


CAAAAACG 


GGCTAGCTACAACGA 


CTTTGACG 


6498 


3724 


UCAAAGAC 


G 


UUUUUGCC 


5511 


GGCAAAAA 


GGCTAGCTACAACGA 


GTCTTTGA 


6499 


3730 


ACGUUUUU 


G 


CCUUUGGG 


5512 


CCCAAAGG 


GGCTAGCTACAACGA 


AAAAACGT 


6500 


3740 


CUUUGGGG 


G 


UGCCGUGG 


5513 


j CCACGGCA 


GGCTAGCTACAACGA 


CCCCAAAG 


6501 


3742 


UUGGGGGU 


G 


CCGUGGAG 


5514 


CTCCACGG 


GGCTAGCTACAACGA 


ACCCCCAA 


6502 


3745 


GGGGUGCC 


G 


UGGAGAAC 


5515 


GTTCTCCA 


GGCTAGCTACAACGA 


GGCACCCC 


6503 


3752 


CGUGGAGA 


A 


CCCCGAGU 


5516 


ACTCGGGG 


GGCTAGCTACAACGA 


TCTCCACG 


6504 


,3759 


AACCCCGA 


G 


UACUUGAC 


5517 


GTCAAGTA 


GGCTAGCTACAACGA 


TCGGGGTT 


6505 


3761 


CCCCGAGU 


A 


CUUGACAC 


5518 


GTGTCAAG 


GGCTAGCTACAACGA 


ACTCGGGG 


6506 


3766 


AGUACUUG 


A 


CACCCCAG 


5519 


CTGGGGTG 


GGCTAGCTACAACGA 


CAAGTACT 


6507 


3768 


UACUUGAC 


A 


CCCCAGGG 


5520 


CCCTGGGG 


GGCTAGCTACAACGA 


GTCAAGTA 


6508 


3781 


AGGGAGGA 


G 


CUGCCCCU 


5521 


AGGGGCAG 


GGCTAGCTACAACGA 


TCCTCCCT 


6509 


3784 


GAGGAGCU 


G 


CCCCUCAG 


5522 


CTGAGGGG 


GGCTAGCTACAACGA 


AGCTCCTC 


6510 


3792 


GCCCCUCA 


G 


CCCCACCC 


5523 


GGGTGGGG 


GGCTAGCTACAACGA 


TGAGGGGC 


6511 


3797 


UCAGCCCC 


-A 


CCCUCCUC 


5524 


GAGGAGGG 


GGCTAGCTACAACGA 


GGGGCTGA 


6512 


3808 


CUCCUCCU 


G 


CCUUCAGC 


5525 


GCTGAAGG 


GGCTAGCTACAACGA 


AGGAGGAG 


6513 


3815 


UGCCUUCA 


G 


CCCAGCCU 


5526 


AGGCTGGG 


GGCTAGCTACAACGA 


TGAAGGCA 


6514 


3820 


UCAGCCCA 


G 


CCUUCGAC 


5527 


GTCGAAGG 


GGCTAGCTACAACGA 


TGGGCTGA 


6515 


3827 


AGCCUUCG 


A 


CAACCUCU 


5528 


AGAGGTTG 


GGCTAGCTACAACGA 


CGAAGGCT 


6516 


3830 


CUUCGACA 


A 


CCUCUAUU 


5529 


AATAGAGG 


GGCTAGCTACAACGA 


TGTCGAAG 


6517 


3836 


CAACCUCU 


A 


UUACUGGG 


5530 


CCCAGTAA 


GGCTAGCTACAACGA 


AGAGGTTG 


6518 


3839 


CCUCUAUU 


A 


CUGGGACC 


5531 


GGTCCCAG 


GGCTAGCTACAACGA 


AATAGAGG 


6519 


3845 


UUACUGGG 


A 


CCAGGACC 


5532 


GGTCCTGG 


GGCTAGCTACAACGA 


CCCAGTAA 


6520 


3851 


GGACCAGG 


A 


CCCACCAG 


5533 


CTGGTGGG 


GGCTAGCTACAACGA 


CCTGGTCC 


6521 


3855 


CAGGACCC 


A 


CCAGAGCG 


5534 


CGCTCTGG 


GGCTAGCTACAACGA 


GGGTCCTG 


6522 
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3861 


CCACCAGA G CGGGGGGC 


5535 


GCCCCCCG GGCTAGCTACAACGA TCTGGTGG 


6523 


3868 


AGCGGGGG G CUCCACCC 


5536 


GGGTGGAG GGCTAGCTACAACGA CCCCCGCT 


6524 


3873 


GGGGCUCC A CCCAGCAC 


5537 


GTGCTGGG GGCTAGCTACAACGA GGAGCCCC 


6525 


3878 


UCCACCCA G CACCUUCA 


5538 


TGAAGGTG GGCTAGCTACAACGA TGGGTGGA 


6526 


3880 


CACCCAGC A CCUUCAAA 


5539 


TTTGAAGG GGCTAGCTACAACGA GCTGGGTG 


6527 


3892 


UCAAAGGG A CACCUACG 


5540 


CGTAGGTG GGCTAGCTACAACGA CCCTTTGA 


6528 


3894 


AAAGGGAC A CCUACGGC 


5541 


GCCGTAGG GGCTAGCTACAACGA GTCCCTTT 


6529 


3898 


GGACACCU A CGGCAGAG 


5542 


CTCTGCCG GGCTAGCTACAACGA AGGTGTCC 


6530 


3901 


CACCUACG G CAGAGAAC 


5543 


GTTCTCTG GGCTAGCTACAACGA CGTAGGTG 


6531 


3908 


GGCAGAGA A CCCAGAGU 


5544 


ACTCTGGG GGCTAGCTACAACGA TCTCTGCC 


6532 


3915 


AACCCAGA G UACCUGGG 


5545 


CCCAGGTA GGCTAGCTACAACGA TCTGGGTT 


6533 


3917 


CCCAGAGU A CCUGGGUC 


5546 


GACCCAGG GGCTAGCTACAACGA ACTCTGGG 


6534 


3923 


GUACCUGG G UCUGGACG 


5547 


CGTCCAGA GGCTAGCTACAACGA CCAGGTAC 


6535 


3929 


GGGUCUGG A CGUGCCAG 


5548 


CTGGCACG GGCTAGCTACAACGA CCAGACCC 


6536 


3931 


GUCUGGAC G UGCCAGUG 


5549 


CACTGGCA GGCTAGCTACAACGA GTCCAGAC 


6537 


3933 


CUGGACGU G CCAGUGUG 


5550 


CACACTGG GGCTAGCTACAACGA ACGTCCAG 


6538 


3937 


ACGUGCCA G UGUGAACC 


5551 


GGTTCACA GGCTAGCTACAACGA TGGCACGT 


6539 


3939 


GUGCCAGU G UGAACCAG 


5552 


CTGGTTCA GGCTAGCTACAACGA ACTGGCAC 


6540 


3943 


CAGUGUGA A CCAGAAGG 


5553 


CCTTCTGG GGCTAGCTACAACGA TCACACTG 


6541 


3951 


ACCAGAAG G CCAAGUCC 


5554 


GGACTTGG GGCTAGCTACAACGA CTTCTGGT 


6542 


3956 


AAGGCCAA G UCCGCAGA 


5555 


TCTGCGGA GGCTAGCTACAACGA TTGGCCTT 


6543 


3960 


CCAAGUCC G CAGAAGCC 


5556 


GGCTTCTG GGCTAGCTACAACGA GGACTTGG 


6544 


3966 


CCGCAGAA G CCCUGAUG 


5557 


CATCAGGG GGCTAGCTACAACGA TTCTGCGG 


6545 


3972 


AAGCCCUG A UGUGUCCU 


5558 


AGGACACA GGCTAGCTACAACGA CAGGGCTT 


6546 


3974 


GCCCUGAU G UGUCCUCA 


5559 


TGAGGACA GGCTAGCTACAACGA ATCAGGGC 


6547 


3976 


CCUGAUGU G UCCUCAGG 


5560 


CCTGAGGA GGCTAGCTACAACGA ACATCAGG 


6548 


3987 


CUCAGGGA G CAGGGAAG 


5561 


CTTCCCTG GGCTAGCTACAACGA TCCCTGAG 


6549 


3996 


CAGGGAAG G CCUGACUU 


5562 


AAGTCAGG GGCTAGCTACAACGA CTTCCCTG 


6550 


4001 


AAGGCCUG A CUUCUGCU 


5563 


AGCAGAAG GGCTAGCTACAACGA CAGGCCTT 


6551 


4007 


UGACUUCU G CUGGCAUC 


5564 


GATGCCAG GGCTAGCTACAACGA AGAAGTCA 


6552 


4011 


UUCUGCUG G CAUCAAGA 


5565 


TCTTGATG GGCTAGCTACAACGA CAGCAGAA 


6553 


4013 


CUGCUGGC A UCAAGAGG 


5566 


CCTCTTGA GGCTAGCTACAACGA GCCAGCAG 


6554 


4021 


AUCAAGAG G UGGGAGGG 


5567 


CCCTCCCA GGCTAGCTACAACGA CTCTTGAT 


6555 


4029 


GUGGGAGG G CCCUCCGA 


5568 


TCGGAGGG GGCTAGCTACAACGA CCTCCCAC 


6556 


4037 


GCCCUCCG A CCACUUCC 


5569 


GGAAGTGG GGCTAGCTACAACGA CGGAGGGC 


6557 


4040 


CUCCGACC A CUUCCAGG 


5570 


CCTGGAAG GGCTAGCTACAACGA GGTCGGAG 


6558 


4052 


CCAGGGGA A CCUGCCAU 


5571 


ATGGCAGG GGCTAGCTACAACGA TCCCCTGG 


6559 


4056 


GGGAACCU G CCAUGCCA 


5572 


TGGCATGG GGCTAGCTACAACGA AGGTTCCC 


6560 


4059 


AACCUGCC A UGCCAGGA 


5573 


TCCTGGCA GGCTAGCTACAACGA GGCAGGTT 


6561 


4061 


CCUGCCAU G CCAGGAAC 


5574 


GTTCCTGG GGCTAGCTACAACGA ATGGCAGG 


6562 


4068 


UGCCAGGA A CCUGUCCU 


5575 


AGGACAGG GGCTAGCTACAACGA TCCTGGCA 


6563 


4072 


AGGAACCU G UCCUAAGG 


5576 


CCTTAGGA GGCTAGCTACAACGA AGGTTCCT 


6564 


4082 


CCUAAGGA A CCUUCCUU 


5577 


AAGGAAGG GGCTAGCTACAACGA TCCTTAGG 


6565 


4094 


UCCUUCCU G CUUGAGUU 


5578 


AACTCAAG GGCTAGCTACAACGA AGGAAGGA 


6566 


4100 


CUGCUUGA G UUCCCAGA 


5579 


TCTGGGAA GGCTAGCTACAACGA TCAAGCAG 


6567 


4108 


GUUCCCAG A UGGCUGGA 


5580 


TCCAGCCA GGCTAGCTACAACGA CTGGGAAC 


6568 


4111 


CCCAGAUG G CUGGAAGG 


5581 


CCTTCCAG GGCTAGCTACAACGA CATCTGGG 


6569 


4121 


UGGAAGGG G UCCAGCCU 


5582 


AGGCTGGA GGCTAGCTACAACGA CCCTTCCA 


6570 


4126 


GGGGUCCA G CCUCGUUG 


5583 


CAACGAGG GGCTAGCTACAACGA TGGACCCC 


6571 


4131 


CCAGCCUC G UUGGAAGA 


5584 


TCTTCCAA GGCTAGCTACAACGA GAGGCTGG 


6572 


4143 


GAAGAGGA A CAGCACUG 


5585 


CAGTGCTG GGCTAGCTACAACGA TCCTCTTC 


6573 


4146 


GAGGAACA G CACUGGGG 


5586 


CCCCAGTG GGCTAGCTACAACGA TGTTCCTC 


6574 
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4148 


nnaaranp a pttppppap 


5587 


r u Tr , PPPAP PPPTAPPTAPA APPA PPTP/TTPP 


6575 


41 Cifi 


urTTnsfWsa p ttptttittpttp 

.tt.i_UOOOO>4. b ULUUUuUu 


5588 


PAPAAAPA PPP r PAPP r PAP*A APP A TPPPPAPT* 
AHAVjtA bbL 1 AbL i ALAALbA ILLUbAbl 


6576 


4162 


P1APITTPTTTTTT P. TTPPATTTTPTT 


5589 




6577 


41 fifi 


PTTTTTTPTTPP A TTTTPTTP A PP 


5590 


PPTPaP»T\ P2PlP r PAP*P u T'A P«A 7VPP A PPTlPSRftP 

bv- 1 tAbAA bAjb- X AbL 1 ALAAUbA OUALAftAO 


6578 


**X f H. 


attttpttpap n cnc^ncinnr* 


5591 


pip!P , p , a r*rr* P50P w t , ap , p*t»ap*«a tipptv ptpjipsat 1 


6579 


41 / ? 


PAPPPPP^TT P* r'PPli A.TTP' A 
OAObbbbU b (,tUUlUuA 


5592 


1VJA1 IajKjVjt brbib. X Abj b, 1 AL-AALAiA AbnibiL-U I C 


6580 


ill 


ppttppppa a ttp.ap»2v.pittp 

OOUOOb.br* M. UuAunLUL 


5593 


p* aptptp^ pnPfivprwpivPnnrvR tvtsp'P'/^a^p^ 
vjAbx X L. xCA WjC TACaC-TACAAUviA x\3GG(JAGG 


6581 




ppaattpap a ninninrn 

L-VJ\rtU bAo A CUL U.rlOOO 


5594 


PPPTRP AP« PPPTAPP^A P 1 A A P'P' A PTPATTPP 
V_b,b- 1 AbiAbr bbb 1 Abb 1 ALAAbbA LlbAl Xbibj 


6582 


41 Q7 

1 X 7 / 


apttpttapp n ttppapttpp 


5595 


r'f'A PTVY>a P1P*P M I' AP*P» r P A P 1 A A P'P 1 A PPTTiPAPT 

bbAL XvjbiA vso b. X Abb X AbAAbAiA LL i Ab» Ab X 


6583 


49f19 


apppttppa p ttppattppp 


5596 


PPPRTPPA riPPTTiPPTAPJV Anpi\ nv^p A P*P*P"T* 
bbbAI bbA bbb X Abb. X AbAAbbA XlibAbbb X 


6584 




ttppapttpp a ttpppa nar 

UV-UrtUUvJU /\ UOOO/\bAb 


5597 


P'TY^TTJP'P'A PPPTTSPPTA P' A A PV~* A PPnPTPPT\ 
b loibbbA bbb X Abb X AbAAbbA bbAL X bbA 


6585 


49 on 


papttppatt p rraraprr 


5598 


bbb X b ibb bbb X Abb X AbAAbbA A X bbAb Xbr 


6586 


4911 


ttppattppp a ranrppan 


5599 


r^T^nnnr^vr^ nric^T* a n p"^ a p 1 a a p*p* a p!p»p<at»p'p»a 
blbbbblb bbb XAbbX AbAAbbA bbbAXbbA 


6587 


4914 
X^l 


ATTPPPAP^A P* OP*OA OOTTTT 
/iuuLLALA b UbbAbbu U 


5600 


AAbLIbbb bbb 1 AbbTAbAAbbA TGTGGCAT 


6588 


491 Q 

fx z J. y 


apa/3Pr»P7v p* pttttp!oooo 
AbAbbbbA b LU UbObbb 


5601 


bbbbbAAG bbb X Ab LTT AbAAbbA TGbGCTGT 


6589 


4 9 94 


oo a o or tt to o oop*t tt tt too 


5602 


GbAAAbbb bbbXAbb^ACAACGA bAAGCTGG 


6590 




ppttttppap a nr'nTrsirir^TT 

bbUUbbAb A UbbUbbbbl 


5603 


7\ nnnnnr 1 7\ p*p»P"t«ap'P , «ti7\ /-i7v a^v^a /^tv^/^a a/^/*^ 
Ab b LAbbA bbb X AbbTAbAAbbA CTGGAAGG 


6591 


4 94 £ 


PI AT TP* PT THP O T T A PT TO AAA 
bAUbbUbb b UntUbAftA 


5604 


ill bAbXA bbblAb UTACAAbbA CbAGbATC 


6592 


4 94 A 




5605 


PPT"J*PP>P PPPTT\PP"T'7k * PniV 71 f~t/"i/~*T\ /~ir^t\ 

bbX X X bAb bbb X Ab LTACAACbA ACCCAGGA 


6593 




T T A P^T TP AAA P* pn TT T A CHir* 
UAbUbAAA b UbUUAbbb 


5606 


^P'P^TA A P'P' PPPTJiPPTHPHTV/V^ riuTwrt/~i7V <TT>7\ 

bbbxAAbb bob X AbbTAbAAbbA TTTCAGTA 


6594 


49££ 


T tt ta ppp a a f r*TTPPPP r rTP« 

U U.MO OO/^rV \j V_UOObb Ub 


5607 


P*A i^P'P'P* A P* PPPTRPPTTV r»T\ /\nP1\ T"TV»/"»/-Wp7V 7V 

CAbbbCAG bbb X AbbTAbAAbbA TTbCbXAA 


6595 


499*1 


PP A A PPT TP O PPTTPQPIiP 
oO-f-VrlOL- UO Vj CLUbAVjAb 


5608 


PTPTP71PP PPPTA PPT71 P A ft PP 1\ /~17\ l»T»/^*«/*1 

b X b X bAbb bbbX AbUTAblAACbA CAGCTTCC 


6596 


49 Q4 


o aoooo a a p 1 pppppptta 

bAbbbbAA b bbbbbbUA 


5609 


T7VPPPPPP PPPT7VPPTI7V OH 7\/VT\ «T**T»/^/^i^/~"T>0 

TAbbbbbb bbb X AbbJX ACJAACbA TxCCCCTb 


6597 


49 R7 


PPPA SPPP O PPPTTfl A OP 1 
bbbAAbbb b LLLUAAbb 


5610 


bbll Abbb bbb X Ab bX AbAAbbA CG CTTCCb 


6598 


49 QR 


pt T a a cine a. p* ttpttp*ttaap 

bUAAbbbA b UbULUAAb 


5611 


bXlAbAbA bbb X AbbTAbAAbbA TbbCTTAG 


6599 


4inn 


A a nnn a pit t o ttphta a oa a 
AAbbuAbU b UL.UAAbAA 


5612 


TTCTTAGA GG CT AG CT ACAACGA ACTCCCTT 


6600 


4 in a 


p»T TPT T A A Pi A A PaSRSP/V 
uUt UnAuA A bAAAAbbb 


5613 


bbb X X X lb bbb X Ab bTACAAbbA TbTTAGAC 


6601 


411 4 


paapaaaa p rv^apppan 

o,rti*OttrtA/\. O OOtMbbbAU 


5614 


A T T i f~U~±f~ tr X y (~ % C~ % /7!P , P u T , AP , P ,r P A OA A A « I u 1 u iir 1 v-irprrtO 

A 1 bbb X bb bbb X AbbTAbAAbbA TTTTG X X b 


6602 


4117 
** j J. / 


paaaappp a pppattttpa 


5615 


TPTARTPiPP PPPTAPPTA OA AOO A nrtn'i'i^rn'P 

XbAAXbbb bb b X AbbTAbAAbbA CbCX 1 X Xb 


6603 


4321 


^(^^^17^(^0^' A TTTTPAPAPA 
rtO ^-Ox\0 O O i-i UULAbAbA 


5616 


rp/*Wfi/*lrpy-i t» t\ OOO»T»AP , 0 r T , A P*A A P»0 A PHHTIPP /"t»p 

X b 1 L. XbAA bbb X Abb X AbAAbbA bbbTbbb X 


6604 


4329 


attttpapap a pt tpttp ppt t 


5617 


RPP/"J\P5\P PPPTTIPPT A P*A A P»0 A PTPTP A A *"P 

AbbbAbAb bbb X Abb 1 AbAAbbA bib X oAA X 


6605 


4332 


PAGAPAPTT P TTPPPTTPAA 


5618 


TT'P 1 APJPIP"' A PP_PTaPPT7VPAaPPa APTP^PTP 
XXbAbbbA bbb XAbbX AbAAbbA AbXL-XbXb 


6606 


4341 


TTPPPTTPAA A PPTTAPTTAP 


5619 


/^np A PT APP PPPTfiPPTA P 1 A A P'P 1 A TTP 1 A P1PJP" 1 A 
b X Ab, X Abb bbb X Abb X AbAAbbA X X bAbbbA 


6607 


4346 


PAAAPPITA CI TTA PTTP.PPP 


5620 


P-PPPBPTB PPPTRPPTA OA SPP A TAPPTTTP 

bbbbAbXA bbb XAbbX AbAAbbA XAbbX X lb 


6608 


4348 


AAPPTTAfJTTV : A PITnPPPPP 


5621 


pppppplip PPPTRPPTA OA AOOA RPTAPPTT 

bbbbbbAb bbb X Abb X AbAAbbA . Ab X Abb X X 


6609 


4351 


PTTAflTTAPTT CI C*C*(~ , OCT , Wf 


5622 


ATP/T'PPP PPPTaPPTRPATiPPR A P"T> A PT 1 A O 

A Xbrbbbbb bbb X Abb X AbAAbbA Ab X Ab X Ab 


6610 


4358 


TTP.PPPPPP A TTPiAPlPlA APi 


5623 


/ «|N|ipnmpii PPPTAPnTA OA A OOA OOO^OOO A 

b X X Lb X bA bbb X Abb 1 AbAAbbA bbbbbbbA 


6611 


4369 


AnPIA AP1P2A A PAfZPA ATTP2 


5624 


PTITTPPTP PPPTAPJPTTV OA A OO A TPPTTPPT 

bA X lob Xo bbb X AbbTAbAAbbA Xbbllbbl 


6612 


4179 


AAP3PJAAPA PI PAATTPJ~ r ITP 
AnUUtnnUA k7 UrlrtUbbUb 


5625 


OA PPAfPTT' PPPTAP/ <M'A OA A OO A TPTTPPTT 

bAbbAl lb bbb 1 AbbTAbAAbbA xoTlbbTT 


6613 


4375 


nAAPA(^PA A TTfZf^TTPTTPA 


5626 


T'OAOAOOA oOO*"PA OOT»A OA A OO A TV"»Orr l O»T»T»0 

IbAbAbbA bbb x AbbTAbAAbbA lbViblTb 


6614 


417fi 


P APPA AT TP 1 P 1 TTP1TTP , AP»TT7\ 


5627 


TAb loAbA ooCT AG bxACAAboA CATXoCTG 


6615 


4380 


PPA ATTPPTT P T TP APT TAT TP 


5628 


OA 'PA PTV 1 A OOOT'AOOrn TV OA A OO A A OOAI^TAOO 

bAlACTGA bb b X Ab bx AbAAboA ACCATTGC 


6616 


4384 


TTPPJTPTTPA P TTATTPPAPP 


5629 


O/ " 1 *00 A T>A pp pfp TV ppm 7V 07V 7V OO 7V rpOAOAOOA 

bb lob AT A GbbTAbbTAbAACGA TGACAbbA 


6617 


4386 


GUGUCAGU A UCCAGGCU 


563 0 


AGCCTGGA GGCTAGCTACAACGA ACTGACAC 


6618 


4392 


GUAUCCAG. G CUUUGUAC 


5631 


GTACAAAG GGCTAGCTACAACGA CTGGATAC 


6619 


4397 


CAGGCUUU G UACAGAGU 


5632 


ACTCTGTA GGCTAGCTACAACGA AAAGCCTG 


6620 


4399 


GGCUUUGU A CAGAGUGC 


5633 


GCACTCTG GGCTAGCTACAACGA ACAAAGCC 


6621 


4404 


UGUACAGA G UGCUUUUC 


5634 


GAAAAGCA GGCTAGCTACAACGA TCTGTACA 


6622 


4406 


UACAGAGU G CUUUUCUG 


5635 


CAGAAAAG GGCTAGCTACAACGA ACTCTGTA 


6623 


4414 


GCUUUUCU G UUUAGUUU 


5636 


AAACTAAA GGCTAGCTACAACGA AGAAAAGC 


6624 


4419 


UCUGUUUA G UUUUUACU 


5637 


AGTAAAAA GGCTAGCTACAACGA TAAACAGA 


6625 


4425 


UAGUUUUU A CUUUUUUU 


5638 


AAaAAAAG GGCTAGCTACAACGA AAAAACTA 


6626 
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4434 


CUUUUUUU G UUUUGUUU 


5639 


AAACAAAA GGCTAGCTACAACGA AAAAAAAG 


6627 


4439 


UUUGUUUU G UUUUUUUA 


5640 


TAAAAAAA GGCTAGCTACAACGA AAAACAAA 


6628 


4451 


UUUUAAAG A UGAAAUAA 


5641 


TTATTTCA GGCTAGCTACAACGA CTTTAAAA 


6629 


4456 


AAGAUGAA A UAAAGACC 


5642 


GGTCTTTA GGCTAGCTACAACGA TTCATCTT 


6630 


4462 


AAAUAAAG A CCCAGGGG 


5643 


CCCCTGGG GGCTAGCTACAACGA CTTTATTT 


6631 



Input Sequence = HSERB2R. Cut Site » R/Y 

Arm Length = 8. Core Sequence = GGCTAGCTACAACGA 

HSERB2R (Human c-erb-B-2 mRNA; 4473 bp) 



-«.-.- ■■• 
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Table V: Human HER2 Synthetic DNAzyme and Target molecules 



Gene 


Pos 


Target 


Seq 
ID 


RPI# 


DNAzyme 


Seq ID 


erbB2 


377 


CCACCA A UGCCAG 


6632 


24998 


cuggca GGCTAGCTACAACGA uggugg B 


6637 


erbB2 


766 


UUCUCCG A UGUGUAA 


6633 


24999 


uuacaca GGCTAGCTACAACGA cggagaa B 


6638 


erbB2 


1202 


UGUGCU A UGGUCU 


6634 


25000 


agacca GGCTAGCTACAACGA agcaca B 


6639 


erbB2 


1444 


CCUCAGC G UCUUCCA 


6635 


25001 


uggaaga GGCTAGCTACAACGA gcugagg B 


6640 


erbB2 


1583 


AUCCACC A UAACACC 


6636 


25002 


gguguua GGCTAGCTACAACGA gguggau B 


6641 



A, G, C, T (italic) = deoxy 
lower case = 2'-0-methyl 
B = inverted deoxyabasic derivative 



qvcnnnrv <:iAfn nonet** 
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Table VI: Human HTV Hammerhead Ribozyme and Substrate Sequence 



Substrate 


Seq 
ID 


Hammerhead 


Seq 
ID 


AUAAAGCU U GCCUUGAG 


6642 


CUCAAGGC CUGAUGAGGC CGUUAGGCCGAA AGCUUUAU 


6727 


AGGCUAAU U UUUUAGGG 


6643 


CCCUAAAA CUGAUGAGGCCGUUAGGCCGAA AUUAGCCU 


6728 


GGCUAAUU U UUUAGGGA 


6644 


UCCCUAAA CUGAUGAGGCCGUUAGGCCGAA AAUUAGCC 


6729 


GCCUCAAU A AAGCUUGC 


6645 


GCAAGCUU CUGAUGAGGCCGUUAGGCCGAA AUUGAGGC 


6730 


UUUCGGGU U UAUUACAG 


6646 


CUGUAAUA CUGAUGAGGCCGUUAGGCCGAA ACCCGAAA 


6731 


GCAGGACU C GGCUUGCU 


6647 


AGCAAGCC CUGAUGAGGCCGUUAGGCCGAA AGUCCUGC 


6732 



Input Sequence = HIV1. Cut Site = UH/. 

Arm Length = 8. Core Sequence = CUGAUGAG GCCGUUAGGC CGAA 
HI VI Consensus 



Underlined region can be any X sequence or linker, as described herein. 
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Table VH: Human UlV Inozyme and Substrate Sequence 



Substrate 


Seq 
ID 


Inozyme 


Seq 
ID 


UGGAAAAC A GAUGGCAG 


6648 


CUGCCAUC CUGAUGAGGCCGUUAGGCCGAA IUUUUCCA 


6733 


AAUAAAGC U UGCCUUGA 


6649 


UCAAGGCA CUGAUGAGGCCGUUAGGCCGAA ICUUUAUU 


6734 


UCUCUAGC A GUGGCGCC 


6650 


GGCGCCAC CUGAUGAGGCCGUUAGGCCGAA ICUAGAGA 


6735 


GGAGCCAC C CCACAAGA 


6651 


UCUUGUGG CUGAUGAGGCCGUUAGGCCGAA IUGGCUCC 


6736 


AGUGGCGC C CGAACAGG 


6652 


CCUGUUCG CUGAUGAGGCCGUUAGGCCGAA ICGCCACU 


6737 


GUGGCGCC C GAACAGGG 


6653 


CCCUGUUC CUGAUGAGGCCGUUAGGCCGAA IGCGCCAC 


6738 


CUCGACGC A GGACUCGG 


6654 


CCGAGUCC CUGAUGAGGCCGUUAGGCCGAA ICGUCGAG 


6739 


CGCAGGAC U CGGCUUGC 


6655 


GCAAGCCG CUGAUGAGGCCGUUAGGCCGAA IUCCUGCG 


6740 



Input Sequence = HIV1. Cut Site « CH/. 

Arm Length = 8 . Core Sequence = CUGAUGAG GCCGUUAGGC CGAA 
HIV1 Consensus 

Underlined region can be any X sequence or linker, as described herein. 
"I" stands for Inosine. 
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Table VEtt: Human HTV Zinzyme and Substrate Sequence 



Substrate 


Seq 
ID 


Zinzyme 


Seq 
ID 


UCAAUAAA G CUUGCCUU 


6656 


AAGGCAAG GCCGAAAGGCGAGUGAGGUCU UUUAUUGA 


6741 


AGGACUCG G CUUGCUGA 


6657 


UCAGCAAG GCCGAAAGGCGAGUGAGGUCU CGAGUCCU 


6742 


GCAGUGGC G CCCGAACA 


6658 


UGUUCGGG GCCGAAAGGCGAGUGAGGUCU GCCACUGC 


6743 


CUCUAGCA G UGGCGCCC 


6659 


GGGCGCCA GCCGAAAGGCGAGUGAGGUCU UGCUAGAG 


6744 


UAGCAGUG G CGCCCGAA 


6660 


UUCGGGCG GCCGAAAGGCGAGUGAGGUCU CACUGCUA 


6745 


AGAGAUGG G UGCGAGAG 


6661 


CUCUCGCA GCCGAAAGGCGAGUGAGGUCU CCAUCUCU 


6746 


AGAUGGGU G CGAGAGCG 


6662 


CGCUCUCG GCCGAAAGGCGAGUGAGGUCU ACCCAUCU 


6747 


CUCUCGAC G CAGGACUC 


6663 


GAGUCCUG GCCGAAAGGCGAGUGAGGUCU GUCGAGAG 


6748 



Input Sequence = HIV1. Cut Site = G/Y 

Arm Length = 8 . Core Sequence = GCcgaaagGCGaGuCaaGGuCu 
HI VI Consensus 
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Table IX: Human HTV DNAzyme and Substrate Sequence 



Substrate 


Seq 
ID 


DNAzyme 


Seq 
ID 


UCAAUAAA G CUUGCCUU 


6656 


AAGGCAAG GGCTAGCTACAACGA TTTATTGA 


6749 


AGGACUCG G CUUGCUGA 


6657 


TCAGCAAG GGCTAGCTACAACGA CGAGTCCT 


6750 


GCAGUGGC G CCCGAACA 


6658 


TGTTCGGG GGCTAGCTACAACGA GCCACTGC 


6751 


CUCUAGCA G UGGCGCCC 


6659 


GGGCGCCA GGCTAGCTACAACGA TGCTAGAG 


6752 


UAGCAGUG G CGCCCGAA 


6660 


TTCGGGCG GGCTAGCTACAACGA CACTGCTA 


6753 


AGAGAUGG G UGCGAGAG 


6661 


CTCTCGCA GGCTAGCTACAACGA CCATCTCT 


6754 


AGAUGGGU G CGAGAGCG 


6662 


CGCTCTCG GGCTAGCTACAACGA ACCCATCT 


6755 


CUCUCGAC G CAGGACUC 


6663 


GAGTCCTG GGCTAGCTACAACGA GTCGAGAG 


6756 


UAUGGAAA A CAGAUGGC 


6664 


GCCATCTG GGCTAGCTACAACGA TTTCCATA 


6757 


GAAAACAG A UGGCAGGU 


6665 


ACCTGCCA GGCTAGCTACAACGA CTGTTTTC 


6758 


AAGCCUCA A UAAAGCUU 


6666 


AAGCTTTA GGCTAGCTACAACGA TGAGGCTT 


6759 


GGAGAGAG A UGGGUGCG 


6667 


CGCACCCA GGCTAGCTACAACGA CTCTCTCC 


6760 


GACGCAGG A CUCGGCUU 


6668 


AAGCCGAG GGCTAGCTACAACGA CCTGCGTC 


6761 



Input Sequence = HIV1 . Cut Site = R/Y 

Arm Length = 8 . Core Sequence = GGCTAGCTACAACGA 

HIV1 Consensus 
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Table XI: Human JfcllV Enzymatic Nucleic Acid and Target molecules 



Target 


Seq ID 


RPI# 


Enzymatic Nucleic Acid 


Seq ID 


GAGAUGG G UGCGAGA 


6718 


25003 


ucucgca GGCTAGCTACAACGA ccaucuc B 


6790 


AUGGAAA A CAGAUGG 


6719 


25004 


ccaucug GGCTAGCTACAACGA uuuccau B 


6791 


AAAACAG A UGGCAGG 


6720 


25005 


ccugcca GGCTAGCTACAACGA cuguuuu B 


6792 


AGCCUCA A UAAAGCU 


6721 


25006 


agcuuua GGCTAGCTACAACGA ugaggcu B 


6793 


GAGAGAG A UGGGUGC 


6722 


25007 


gcaccca GGCTAGCTACAACGA cucucuc B 


6794 


CAAUAAA G CUUGCCU 


6723 


25008 


aggcaag gccgaaaggCgagugaGGuCu uuuauug B 


6795 


GGACUCG G CUUGCUG 


6724 


25009 


cagcaag gccgaaaggCgagugaGGuCu cgagucc B 


6796 


GAGAUGG G UGCGAGA 


6718 


25010 


ucucgca gccgaaaggCgagugaGGuCu ccaucuc B 


6797 


GAUGGGU G CGAGAGC 


6725 


25011 


gcucucg gccgaaaggCgagugaGGuCu acccaiic B 


6798 


| UCUCGAC G CAGGACU 


6726 


25012 


aguccug gccgaaaggCgagugaGGuCu gucgaga B 


6799 1 



G = Guanosine 

A, G t C, T (italic) = deoxy 

lower case = 2'-0-methyl 

s = phosphorothioate 3'-internucleotide linkage 

C = 2 '-deoxy-2 Amino cytidine 

B = inverted deoxyabasic derivative 
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Table XII: Human fflV-1 Sequences 



oenuafiiv 
Acc# 


Con N^mo/c\ 




Oraani<?m 

1 tjCLI 1 IOI 1 1 


A04321 


IIIB LA! 


B 


HIV-1 


AF1 10962 


96BW0402 


C 


HIV-1 


AF1 10963 


96BW0407 


C 


HIV-1 


AF1 10968 


96BW0504 


c 


HIV-1 


AF1 10965 


96BW0409 


c 


HIV-1 


AF1 10966 


96BW0410 


c 


HIV-1 


AF1 10964 


96BW0408 


c 


HIV-1 


AF1 10975 


96BW15C05 


c 


HIV-1 


AF1 10974 


96BW15C02 


c 


HIV-1 


AF1 10973 


96BW15B03 


c 


HIV-1 ! 


AF1 07771 


UGSE8131 


A 


HIV-1 


U69585 


WCIPR854 


B 


HIV-1 


U69588 


WCIPR855 


B 


HIV-1 


U69589 j 


WCIPR9011 


B 


HIV-1 


U69591 
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CLAIMS 

What we claim is: 

1. A siRNA nucleic acid molecule that modulates expression of a nucleic acid 
molecule encoding HER2. 

2. A enzymatic nucleic acid molecule that modulates expression of a nucleic 
acid molecule encoding HER2. 

3. An enzymatic nucleic acid molecule comprising a sequence selected from the 
group consisting of SEQ ID NOs: 5644-6631 and 6637-6641. 

4. An enzymatic nucleic acid molecule comprising at least one binding arm 
wherein one or more of said binding arms comprises a sequence 
complementary to a sequence selected from the group consisting of SEQ ID 
NOs: 4656-5643 and 6632-6636. 

5. A siRNA nucleic acid molecule comprising a sequence complementary to a 
sequence selected from the group consisting of SEQ ID NOs: 4656-5643 and 
6632-6636. 

6. The nucleic acid molecule of any of claims 1-5, wherein said nucleic acid 
molecule is adapted to treat cancer. 

7. The enzymatic nucleic acid molecule of any of claims 2-4, wherein said 
enzymatic nucleic acid molecule has an endonuclease activity to cleave RNA 
having HER2 sequence. 

8. The enzymatic nucleic acid molecule of claim 2, wherein said enzymatic 
nucleic acid molecule is a DNAzyme in a 10-23 configuration. 

9. The enzymatic nucleic acid molecule of claim 8, wherein said enzymatic 
nucleic acid molecule comprises a sequence complementary to a sequence 
selected from the group consisting of SEQ ID NOs: 4656-5643 and 6632- 
6636. 
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10. The enzymatic nucleic acid molecule of claim 8, wherein said enzymatic 
nucleic acid molecule comprises a sequence selected from the group 
consisting of SEQ ID NOs: 5644-6631 and 6637-6641. 

1 1 . The nucleic acid molecule of any of claims 1, 2, 4 or 5, wherein said nucleic 
acid molecule comprises between 12 and 100 bases complementary to a 
RNA having HER2 sequence. 

12. The nucleic acid molecule of claim of any of claims 1, 2, 4 or 5, wherein said 
nucleic acid molecule comprises between 14 and 24 bases complementary to 
a RNA having HER2 sequence. 

13. The nucleic acid molecule of any of claims 1-5, wherein said nucleic acid 
molecule is chemically synthesized. 

14. The nucleic acid molecule of any of claims 1-5, wherein said nucleic acid 
molecule comprises at least one 2'-sugar modification. 

15. The nucleic acid molecule of any of claims 1-5, wherein said nucleic acid 
molecule comprises at least one nucleic acid base modification. 

16. The nucleic acid molecule of any of claims 1-5, wherein said nucleic acid 
molecule comprises at least one phosphate backbone modification. 

17. A mammalian cell comprising the nucleic acid molecule of any of claims 1- 

5. 

18. The mammalian cell of claim 17, wherein said mammalian cell is a human 
cell. 

19. A method of reducing HER2 activity in a cell, comprising contacting said 
cell with the nucleic acid molecule of any of claims 1-5, under conditions 
suitable for said reduction of HER2 activity. 

20. A method of treatment of a subject having a condition associated with the 
level of HER2, comprising contacting cells of said subject with the nucleic 
acid molecule of any of claims 1-5, under conditions suitable for said 
treatment. 

21. The method of claim 20 further comprising the use of one or more drug 
therapies under conditions suitable for said treatment. 
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22. A method of cleaving RNA having HER2 sequence comprising contacting 
an enzymatic nucleic acid molecule of any of claims 2-4 with said RNA 
under conditions suitable for the cleavage. 

23. The method of claim 22, wherein said cleavage is carried out in the presence 
of a divalent cation. 

24. The method of claim 23, wherein said divalent cation is Mg 2+ . 

25. The nucleic acid molecule of any of claims 1-5, wherein said nucleic acid 
molecule comprises a cap structure, wherein the cap structure is at the 5'- 
end, 3 '-end, or both the 5 '-end and the 3 '-end of said nucleic acid molecule. 

26. The nucleic acid molecule of claim 25, wherein the cap structure at the 5'- 
end, 3 '-end, or both the 5 '-end and the 3 '-end comprises a 3 ',3 '-linked or 
5\5 '-linked deoxyabasic ribose derivative. 

27. An expression vector comprising a nucleic acid sequence encoding at least 
one nucleic acid molecule of any of claims 1-5 in a manner that allows 
expression of the nucleic acid molecule. 

28. A mammalian cell comprising an expression vector of claim 27. 

29. The mammalian cell of claim 28, wherein said mammalian cell is a human 
cell. 

30. The expression vector of claim 27, wherein said nucleic acid molecule is in a 
DNAzyme configuration. 

31. The expression vector of claim 27, wherein said expression vector further 
comprises a sequence for a nucleic acid molecule complementary to a 
nucleic acid molecule having HER2 sequence. 

32. The expression vector of claim 27, wherein said expression vector comprises 
a nucleic acid sequence encoding two or more of said nucleic acid molecules, 
which may be the same or different. 

33. The expression vector of claim 32, wherein said expression vector further 
comprises a sequence encoding an antisense nucleic acid molecule or siRNA 
molecule complementary to a nucleic acid molecule having HER2 sequence. 
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34. A method for treatment of cancer comprising administering to a subject the 
nucleic acid molecule of any of claims 1-5 under conditions suitable for said 
treatment. 

35 . The method of claim 34, wherein said cancer is breast cancer. 

36. The method of claim 34, wherein said cancer is ovarian cancer. 

37. The method of claim 34, wherein said method further comprises 
administering to said subject one or more other therapies under conditions 
suitable for said treatment. 

38. The method of claim 21 wherein said other drug therapies are chosen from 
monoclonal antibody therapy, chemotherapy, radiation therapy, and analgesic 
therapy. 

39. The method of claim 37 wherein said other drug therapies are chosen from 
monoclonal antibody therapy, chemotherapy, radiation therapy, and analgesic 
therapy. 

40. The method of claim 38, wherein said chemotherapy is selected from the 
group consisting of paclitaxel (Taxol), docetaxel, cisplatin, methotrexate, 
cyclophosphamide, doxorubin, fluorouracil carboplatin, edatrexate, 
gemcitabine, and vinorelbine. 

41. The method of claim 38, wherein said monoclonal antibody is Herceptin 
(trastuzumab). 

42. The method of claim 39, wherein said chemotherapy is selected from the 
group consisting of paclitaxel (Taxol), docetaxel, cisplatin, methotrexate, 
cyclophosphamide, doxorubin, fluorouracil carboplatin, edatrexate, 
gemcitabine, and vinorelbine. 

43. The method of claim 39, wherein said monoclonal antibody is Herceptin 
(trastuzumab). 

44. A composition comprising a nucleic acid molecule of any of claims 1-5 in a 
pharmaceutically acceptable carrier. 
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45. A method of administering to a cell a nucleic acid molecule of any of claims 
1-5 comprising contacting said cell with the nucleic acid molecule under 
conditions suitable for said administration. 

46. The method of claim 45, wherein said cell is a mammalian cell. 

47. The method of claim 45, wherein said cell is a human cell. 

48. The method of claim 45, wherein said administration is in the presence of a 
delivery reagent. 

49. The method of claim 48, wherein said delivery reagent is a lipid. 

50. The method of claim 49, wherein said lipid is a cationic lipid. 

5 1 . The method of claim 49, wherein said lipid is a phospholipid. 

52. The method of claim 48, wherein said delivery reagent is a liposome. 

53. A siRNA nucleic acid molecule that modulates expression of a nucleic acid 
molecule encoding K-Ras. 

54. A siRNA nucleic acid molecule that modulates expression of a nucleic acid 
molecule encoding H-Ras or N-Ras. 

55. An enzymatic nucleic acid molecule that modulates expression of a nucleic 
acid molecule encoding K-Ras. 

56. An enzymatic nucleic acid molecule that moduates expression of a nucleic 
acid molecule encoding H-Ras or N-Ras. 

57. An enzymatic nucleic acid molecule comprising a sequence of SEQ ID NOs: 
2329-4655. 

58. An enzymatic nucleic acid molecule comprising at least one binding arm 
wherein one or more of said binding arms comprises a sequence 
complementary to a sequence of SEQ ID NOs: 1-2328. 



59. 



A siRNA nucleic acid molecule comprising a sequence complementary to a 
sequence ofSEQIDNOs: 1-2328. 
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60. The nucleic acid molecule of any of claims 53-59, wherein said nucleic acid 
molecule is adapted to treat cancer. 

61. The enzymatic nucleic acid molecule of any of claims 55, 57 or 58, wherein 
said enzymatic nucleic acid molecule has an endonuclease activity to cleave 
RNA having a K-Ras sequence. 

62. The enzymatic nucleic acid molecule of any of claims 56-58, wherein said 
enzymatic nucleic acid molecule has an endonuclease activity to cleave RNA 
having an H-Ras sequence. 

63. The enzymatic nucleic acid molecule of claim 55 or claim 56, wherein said 
enzymatic nucleic acid molecule is a DNAzyme in a 10-23 configuration. 

64. The enzymatic nucleic acid molecule of claim 63, wherein said enzymatic 
nucleic acid molecule comprises a sequence complementary to a sequence of 
SEQIDNOs: 1-2328. 

65. The enzymatic nucleic acid molecule of claim 63, wherein said enzymatic 
nucleic acid molecule comprises a sequence of SEQ ID NOs: 2329-4655. 

66. The nucleic acid molecule of any of claims 53-59, wherein said nucleic acid 
molecule comprises between 12 and 100 bases complementary to an RNA 
having K-Ras, H-Ras and/or N-Ras sequence. 

67. The nucleic acid molecule of any of claims 53-59, wherein said nucleic acid 
molecule comprises between 14 and 24 bases complementary to an RNA 
having K-Ras, H-Ras and/or N-Ras sequence. 

68. The nucleic acid molecule of any of claims 53-59, wherein said nucleic acid 
molecule is chemically synthesized. 

69. The nucleic acid molecule of any of claims 53-59, wherein said nucleic acid 
molecule comprises at least one 2'-sugar modification. 

70. The nucleic acid molecule of any of claims 53-59, wherein said nucleic acid 
molecule comprises at least one nucleic acid base modification. 

71. The nucleic acid molecule of any of claims 53-59, wherein said enzymatic 
nucleic acid molecule comprises at least one phosphate backbone 
modification. 
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72. A mammalian cell comprising the nucleic acid molecule of any of claims 
53-59. 

73. The mammalian cell of claim 72, wherein said mammalian cell is a human 
cell. 

74. A method of reducing K-Ras activity in a cell, comprising contacting said 
cell with the nucleic acid molecule of any of claims 53, 55, 57, 58 or 59, 
under conditions suitable for said reduction of K-Ras activity. 

75. A method of reducing H-Ras activity in a cell, comprising contacting said 
cell with the nucleic acid molecule of any of claims 54, 56, 57, 58 or 59, 
under conditions suitable for said reduction of H-Ras activity. 

76. A method of treatment of a subject having a condition associated with the 
level of K-Ras, comprising contacting cells of said subject with the nucleic 
acid molecule of any of claims 53, 55, 57, 58 or 59, under conditions suitable 
for said treatment. 

77. A method of treatment of a subject having a condition associated with the 
level of H-Ras, comprising contacting cells of said subject with the nucleic 
acid molecule of any of claims 54, 56, 57, 58 or 59, under conditions suitable 
for said treatment 

78. The method of claim 76 further comprising the use of one or more drug 
therapies under conditions suitable for said treatment. 

79. The method of claim 77 further comprising the use of one or more drug 
therapies under conditions suitable for said treatment 

80. A method of cleaving RNA having a K-Ras sequence comprising contacting 
an nucleic acid molecule of any of claims 53, 55, 57, 58 or 59, with said 
RNA under conditions suitable for the cleavage. 

81 . A method of cleaving RNA having a H-Ras sequence comprising contacting 
an nucleic acid molecule of any of claims 54, 56, 57, 58 or 59, with said 
RNA under conditions suitable for the cleavage. 

82. The method of claim 80, wherein said cleavage is carried out in the presence 
of a divalent cation. 
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83. The method of claim 81, wherein said cleavage is carried out in the presence 
of a divalent cation. 

84. The method of claim 82 3 wherein said divalent cation is Mg 2+ . 

85. The method of claim 83, wherein said divalent cation is Mg 2+ - 

86. The nucleic acid molecule of any of claims 53-59, wherein said nucleic acid 
molecule comprises a cap structure, wherein the cap structure is at the 5'- 
end, 3'-end, or both the 5'-end and the 3'-end of said nucleic acid molecule. 

87. The nucleic acid molecule of claim 86, wherein the cap structure comprises 
a 3 %3 '-linked or S'jSMinked deoxyabasic ribose derivative. 

88. An expression vector comprising a nucleic acid sequence encoding at least 
one nucleic acid molecule of any of claims 53-59 in a manner that allows 
expression of the nucleic acid molecule. 

89. A mammalian cell comprising an expression vector of claim 88. 

90. The mammalian cell of claim 89, wherein said mammalian cell is a human 
cell. 

91 . The expression vector of claim 88, wherein said nucleic acid molecule is in a 
DNAzyme configuration. 

92. The expression vector of claim 88, wherein said expression vector further 
comprises a sequence for a nucleic acid molecule complementary to a 
nucleic acid molecule having a K-Ras sequence. 

93. The expression vector of claim 88, wherein said expression vector further 
comprises a sequence for a nucleic acid molecule complementary to a 
nucleic acid molecule having a H-Ras sequence. 

94. The expression vector of claim 88, wherein said expression vector comprises 
a nucleic acid sequence encoding two or more of said nucleic acid molecules, 
which may be the same or different. 

95. The expression vector of claim 88, wherein said expression vector further 
comprises a sequence encoding an antisense nucleic acid molecule or siRNA 
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nucleic acid molecule complementary to a nucleic acid molecule having a K- 
Ras sequence. 

96. The expression vector of claim 88, wherein said expression vector further 
comprises a sequence encoding an antisense nucleic acid molecule or siRNA 
nucleic-acid molecule complementary to a nucleic acid molecule having a H- 
Ras sequence. 

97. A method for the treatment of cancer comprising administering to a subject 
(he nucleic acid molecule of any of claims 53-59 under conditions suitable 
for said treatment. 

98. The method of claim 97, wherein said cancer is colorectal cancer. 

99. The method of claim 97, wherein said cancer is lung cancer. 

100. The method of claim 97, wherein said cancer is prostate cancer. 

101. The method of claim 97, wherein said cancer is bladder cancer. 

1 02. The method of claim 97, wherein said cancer is breast cancer* 

103. The method of claim 97, wherein said cancer is pancreatic cancer. 

104. The method of claim 97, wherein said method further comprises 
administering to said patient one or more other therapies under conditions 
suitable for said treatment. 

105. The method of claim 78 wherein said other drug therapies are chosen from 
monoclonal antibody therapy, chemotherapy, radiation therapy, and analgesic 
therapy. 

106. The method of claim 79 wherein said other drug therapies are chosen from 
monoclonal antibody therapy, chemotherapy, radiation therapy, and analgesic 
therapy. 

107. The method of claim 104 wherein said other drug therapies are chosen from 
monoclonal antibody therapy, chemotherapy, radiation therapy, and analgesic 
therapy. 
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108. The method of claim 105, wherein said chemotherapy is selected from the 
group consisting of paclitaxel (Taxol), docetaxel, cisplatin, methotrexate, 
cyclophosphamide, doxorubin, fluorouracil carboplatin, edatrexate, 
gemcitabine, and vinorelbine. 

109. The method of claim 106, wherein said chemotherapy is selected from the 
group consisting of paclitaxel (Taxol), docetaxel, cisplatin, methotrexate, 
cyclophosphamide, doxorubin, fluorouracil carboplatin, edatrexate, 
gemcitabine, and vinorelbine. 

110. The method of claim 107, wherein said chemotherapy is selected from the 
group consisting of paclitaxel (Taxol), docetaxel, cisplatin, methotrexate, 
cyclophosphamide, doxorubin, fluorouracil carboplatin, edatrexate, 
gemcitabine, and vinorelbine. 

111. A composition comprising a nucleic acid molecule of any of claims 53-59 
and a pharmaceutically acceptable carrier. 

112. A method of administering to a cell a nucleic acid molecule of any of claims 
53-59 comprising contacting said cell with the enzymatic nucleic acid 
molecule under conditions suitable for said administration. 

113. The method of claim 112, wherein said cell is a mammalian cell. 

1 14. The method of claim 113, wherein said cell is a human cell. 

115. The method of claim 1 12, wherein said administration is in the presence of a 
delivery reagent 

116. The method of claim 115, wherein said delivery reagent is a lipid. 

1 17. The method of claim 1 16, wherein said lipid is a cationic lipid. 

118. The method of claim 116, wherein said lipid is a phospholipid. 

119. The method of claim 115, wherein said delivery reagent is a liposome. 

120. A siRNA nucleic acid molecule which modulates expression of a nucleic 
acid molecule encoding HEV or a component of HIV. 
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121. An enzymatic nucleic acid molecule which modulates expression of a 
nucleic acid molecule encoding HTV or a component of HIV, wherein said 
enzymatic nucleic acid molecule is in an Inozyme, G-cleaver, Zinzyme or 
Amberzyme configuration. 

122. An enzymatic nucleic acid molecule comprising a sequence selected from the 
group consisting of SEQ ID NOs. 6727-6799. 

123. An enzymatic nucleic acid molecule comprising at least one binding arm 
wherein one or more of said binding arms comprises a sequence 
complementary to a sequence selected from the group consisting of SEQ ID 
NOs. 6642-6726. 

124. A siRNA nucleic acid molecule comprising a sequence complementary to a 
sequence selected from the group consisting of SEQ ID NOs. 6642-6726. 

125. The nucleic acid of any of claims 120-124, wherein said nucleic acid 
molecule is adapted to HIV infection or acquired immunodeficiency 
syndrome (AIDS). 

126. The enzymatic nucleic acid molecule of any of claims 121-123, wherein said 
enzymatic nucleic acid molecule has an endonuclease activity to cleave RNA 
having a HIV sequence. 

127. The enzymatic nucleic acid molecule of claim 121, wherein said enzymatic 
nucleic acid molecule is in an Inozyme configuration. 

128. The enzymatic nucleic acid molecule of claim 121, wherein said enzymatic 
nucleic acid molecule is in a Zinzyme configuration. 

129. The enzymatic nucleic acid molecule of claim 121, wherein said enzymatic 
nucleic acid molecule is in a G-cleaver configuration. 

130. The enzymatic nucleic acid molecule of claim 121, wherein said enzymatic 
nucleic acid molecule is in an Amberzyme configuration. 

131. The enzymatic nucleic acid molecule of claim 123, wherein said enzymatic 
nucleic acid molecule is in a DNAzyme configuration. 

132. The enzymatic nucleic acid molecule of claim 123, wherein said enzymatic 
nucleic acid molecule is in a Hammerhead configuration. 
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133. The enzymatic nucleic acid molecule of claim 127, wherein said Inozyme 
comprises a sequence complementary to a sequence selected from the group 
consisting of SEQ ID NOs. 6648-6655. 

134. The enzymatic nucleic acid molecule of claim 127, wherein said Inozyme 
comprises a sequence selected from the group consisting of SEQ ID NOs. 
6733-6740. 

135. The enzymatic nucleic acid molecule of claim 128, wherein said Zinzyme 
comprises a sequence complementary to a sequence selected from the group 
consisting of SEQ ID NOs. 6656-6663 and 6723-6726. 

136. The enzymatic nucleic acid molecule of claim 128, wherein said Zinzyme 
comprises a sequence selected from the group consisting of SEQ ID NOs. 
6741-6748 and 6795-6799. 

137. The enzymatic nucleic acid molecule of claim 130, wherein said Amberzyme 
comprises a sequence complementary to a sequence selected from the group 
consisting of SEQ ID NOs. 6656-6688. 

138. The enzymatic nucleic acid molecule of claim 130, wherein said Amberzyme 
comprises a sequence selected from the group consisting of SEQ ID NOs. 
6762-6789. 

139. The enzymatic nucleic acid molecule of claim 131, wherein said DNAzyme 
comprises a sequence complementary to a sequence selected from the group 
consisting of SEQ ID NOs. 6656-6668 and 6718-6722. 

140. The enzymatic nucleic acid molecule of claim 131, wherein said DNAzyme 
comprises a sequence selected from the group consisting of SEQ ID NOs. 
6749-6761 and 6790-6794. 

141. The enzymatic nucleic acid molecule of claim 132, wherein said 
Hammerhead comprises a sequence complementary to a sequence selected 
from the group consisting of SEQ ID NOs. 6642-6647. 

142. The enzymatic nucleic acid molecule of claim 132, wherein said 
Hammerhead comprises a sequence selected from the group consisting of 
SEQ ID NOs 6727-6732. 
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143. The nucleic acid molecule of any of claims 120-124, wherein said nucleic 
acid molecule comprises between 12 and 100 bases complementary to a 
nucleic acid molecule encoding HIV. 

144. The nucleic acid molecule of any of claims 120-124, wherein said nucleic 
acid molecule comprises between 14 and 24 bases complementary to a 
nucleic acid molecule encoding HIV. 

145. The nucleic acid molecule of any of claims 120-124, wherein said nucleic 
acid molecule is chemically synthesized. 

146. The nucleic acid molecule of any of claims 120-124, wherein said nucleic 
acid molecule comprises at least one 2 5 -sugar modification. 

147. The nucleic acid molecule of any of claims 120-124, wherein said nucleic 
acid molecule comprises at least one nucleic acid base modification. 

148. The nucleic acid molecule of any of claims 120-124, wherein said nucleic 
acid molecule comprises at least one phosphate backbone modification. 

149. A mammalian cell comprising the nucleic acid molecule of any of claims 
120-124 

150. The mammalian cell of claim 149, wherein said mammalian cell is a human 
cell. 

151. A method of reducing HIV activity in a cell, comprising contacting said cell 
with the nucleic acid molecule of any of claims 120-124, under conditions 
suitable for said reduction of HIV activity. 

152. A method of treatment of a subject having a condition associated with the 
level of HIV, comprising contacting cells of said subject with the nucleic 
acid molecule of any of claims 120-124, under conditions suitable for said 
treatment. 

153. The method of claim 151 further comprising the use of one or more drug 
therapies under conditions suitable for said treatment. 

154. The method of claim 152 further comprising the use of one or more drug 
therapies under conditions suitable for said treatment. 
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155. A method of cleaving RNA of an HIV gene comprising contacting an 
enzymatic nucleic acid molecule of any of claims 121-123 with said RNA of 
a HIV gene under conditions suitable for the cleavage. 

156. The method of claim 155, wherein said cleavage is carried out in the 
presence of a divalent cation. 

157. The method of claim 156, wherein said divalent cation is Mg 2+ . 

158. The nucleic acid molecule of any of claims 120-124, wherein said nucleic 
acid molecule comprises a cap structure, wherein the cap structure is at the 
5'-end, S'-end, or both the 5'-end and the 3'-end of said nucleic acid 
molecule. 

159. The nucleic acid molecule of claim 158, wherein the cap structure at the 5'- 
end, 3'-end, or both the 5'-end and the 3'-end comprises a 3\3'-linked or 
5\5'-linked deoxyabasic ribose derivative. 

160. An expression vector comprising a nucleic acid sequence encoding at least 
one nucleic acid molecule of any of claims 120-124 in a manner which 
allows expression of the nucleic acid molecule. 

161. A mammalian cell comprising an expression vector of claim 1 60. 

162. The mammalian cell of claim 161, wherein said mammalian cell is a human 
cell. 

163. An expression vector comprising a nucleic acid sequence encoding at least 
one nucleic acid molecule of any of claims 122 or 123 in a manner which 
allows expression of the nucleic acid molecule, wherein said nucleic acid 
molecule is in a hammerhead configuration. 

164. The expression vector of claim 160, wherein said expression vector further 
comprises a sequence for a nucleic acid molecule complementary to the 
RNA of HIV. 

165. The expression vector of claim 160, wherein said expression vector 
comprises a nucleic acid sequence encoding two or more of said nucleic acid 
molecules, which may be the same or different. 
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166. The expression vector of claim 165, wherein said expression vector further 
comprises a sequence encoding a siRNA nucleic acid molecule 
complementary to the RNA of HIV gene. 

167. A method for treatment of acquired immunodeficiency syndrome (AIDS) or 
an AIDS related condition comprising administering to a subject the nucleic 
acid molecule of any of claims 120-124 under conditions suitable for said 
treatment. 

168. The method of claim 167, wherein said AIDS related condition is Kaposi's 
sarcoma, lymphoma, cervical cancer, squamous cell carcinoma, cardiac 
myopathy, rheumatic disease, or opportunistic infection. 

169. The method of claim 167, wherein said method further comprises 
administering to said subject one or more other therapies. 

170. The nucleic acid molecule of claim 121 or claim 123, wherein said nucleic 
acid molecule comprises at least five ribose residues, at least ten 2 ! -<9-methyl 
modifications, and a 3 5 - end modification. 

171. The nucleic acid molecule of claim 170, wherein said nucleic acid molecule 
further comprises phosphorothioate linkages on at least three of the 5 f 
terminal nucleotides. 

172. The nucleic acid molecule of claim 170, wherein said 3'- end modification is 
a 3 '-3 5 inverted abasic moiety. 

173. The method of claim 153 wherein said other drug therapies chosen from 
antiviral therapy, monoclonal antibody therapy, chemotherapy, radiation 
therapy, analgesic therapy, and anti-inflammatory therapy. 

174. The method of claim 173, wherein said antiviral therapy is chosen from 
treatment with AZT, ddC, ddl, d4T, 3TC, Ribavirin, delvaridine, nevirapine, 
efravirenz, ritonavir, saquinivir, indinavir, amprenivir, nelfinavir, and 
lopinavir. 

175. The method of claim 154 wherein said other drug therapies are chosen from 
antiviral therapy, monoclonal antibody therapy, chemotherapy, radiation 
therapy, analgesic therapy, and anti-inflammatory therapy. 
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176. The method of claim 175, wherein said antiviral therapy is chosen from 
treatment with AZT, ddC, ddl, d4T, 3TC, Ribavirin, delvaridine, nevirapine, 
efravirenz, ritonavir, saquinivir, indinavir, amprenivir, nelfinavir, and 
lopinavir. 

177. The method of claim 169 wherein said other drug therapies are chosen from 
antiviral therapy, monoclonal antibody therapy, chemotherapy, radiation 
therapy, analgesic therapy, and anti-inflammatory therapy. 

178. The method of claim 177, wherein said antiviral therapy is chosen from 
treatment with AZT, ddC, ddl, d4T, 3TC, Ribavirin, delvaridine, nevirapine, 
efravirenz, ritonavir, saquinivir, indinavir, amprenivir, nelfinavir, and 
lopinavir. 

179. A pharmaceutical composition comprising a nucleic acid molecule of any of 
claims 120-124 in a pharmaceutically acceptable carrier. 

1 80. The nucleic acid molecule of claim 120 or 121, wherein said component of 
HTV is nef. 

181. The nucleic acid molecule of claim 120 or 121, wherein said component of 
HTVisvif. 

182. The nucleic acid molecule of claim 120 or 121, wherein said component of 
HTV is tat. 

183. The nucleic acid molecule of claim 120 or 121, wherein said component of 
HTV is rev. 

184. The nucleic acid molecule of claim 120 or 121, wherein said component of 
HTV is LTR. 

185. The nucleic acid molecule of claim 184, wherein said LTR is the 3'-LTR. 

186. The nucleic acid molecule of claim 184, wherein said LTR is the 5'-LTR. 

187. A method of administering to a cell a nucleic acid molecule of any of claims 
120-124 comprising contacting said cell with the nucleic acid molecule 
under conditions suitable for said administration. 



1 88. The method of claim 1 87, wherein said cell is a mammalian cell. 
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189. The method of claim 1 87, wherein said cell is a human cell. 

190. The method of claim 187, wherein said administration is in the presence of a 
delivery reagent. 

191 . The method of claim 190, wherein said delivery reagent is a lipid. 

192. The method of claim 191, wherein said lipid is a cationic lipid. 

1 93 . The method of claim 191, wherein said lipid is a phospholipid. 

194. The method of claim 190, wherein said delivery reagent is a liposome. 
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